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ABSTRACT 

Schemes  for  quantization  and  fusion  in  multi-sensor  systems  used  for  discriminating 
between  two  sequences  of  dependent  observations  are  introduced  and  analyzed.  The  observa¬ 
tion  sequences  of  each  sensor  under  the  two  hypotheses  are  arbitrary  stationary  dependent 
sequences  that  can  not  be  modeled  as  signal  in  additive  noise;  the  objective  of  the  fusion  center 
is  to  disrciminate  between  the  two  hypotheses.  These  observation  models  are  well  motivated  by 
practical  multi-sensor  target  discrimination  problems.  Two  cases  are  considered:  in  the  first,  the 
observation  sequences  of  the  sensors  are  individually  dependent  but  jointly  mutually  indepen¬ 
dent;  in  the  second  case,  the  observation  sequences  are  dependent  across  both  time  and  sensors. 
The  dependence  in  the  observations  across  time  and/or  sensors  is  modeled  by  m  -dependent,  <j>- 
mixing,  or  p-mixing  processes.  The  following  four  quantization/fusion  schemes  are  considered: 
(a)  forming  test  statistics  at  the  sensors  by  passing  the  observations  through  memoryless  non- 
linearities,  summing  them  up,  and  fusing  these  test  statistics  without  previous  quantization;  (b) 
quantizing  uniformally  (with  equidistant  breakpoints)  each  sensor  observation  and  then  fusing; 
(c)  quantizing  optimally  each  sensor  observation  and  then  fusing;  and  (d)  using  the  sensor  test 
statistic  of  (a)  to  make  binary  decisions  and  then  fusing  the  binary  decisions.  To  guarantee 
high-quality  performance,  a  common  large  sample  size  is  employed  by  each  sensor  and  an 
asymptotic  analysis  is  pursued.  Design  criteria  are  developed  from  the  Bayesian  cost  of  the 
fusion  center  for  deriving  the  optimal  memoryless  nonlinearities  of  the  sensor  test  statistics  and 
the  sensor  quantizer  parameters  (quantization  levels  and  breakpoints).  These  design  criteria  are 
shown  to  involve  an  extension  of  the  generalized  signal-to-noise  ratio  used  in  single-sensor 
detection  and  quantization.  The  optimal  nonlinearities  and  quantizers  are  obtained  as  the  solu¬ 
tions  of  linear  coupled  or  uncoupled  integral  equations  involving  the  univariate  and  bivariate 
probability  densities  of  the  sensor  observations.  Numerical  results  based  on  simulation  are 
presented  for  specific  cases  of  practical  interest  to  compare  the  relative  performance  of  the  four 
quantization/fusion  schemes  described  above  and  to  establish  their  superiority  to  schemes  that 
ignore  the  dependence  across  time  and/or  sensors  in  the  observations. 


This  research  was  supported  in  part  by  the  Office  of  Naval  Research  under  contract  N00014-89-J-1375  and  in  part  by  the 
Systems  Research  Center  at  the  University  of  Maryland,  College  Park,  through  the  National  Science  Foundation’s  Engineering 
Research  Centers  Program:  NSF  CDR  8803012. 


I.  INTRODUCTION  AND  PROBLEM  FORMULATION 

In  most  of  the  recent  works  on  multi-sensor  detection  with  a  fusion  center  (see  [l]-[5]),  the 
observations  are  assumed  to  be  independent  across  time  (by  being  modeled  as  i.i.d.  sequences 
of  random  variables)  and  sensors  (through  the  assumption  that  the  observation  sequences  of 
the  different  sensors  are  mutually  independent,  when  conditioned  on  a  particular  hypothesis 
being  true),  both  of  which  are  intended  to  make  the  analysis  tractable.  Yet  the  observations  are 
generally  dependent.  Indeed,  the  observation  processes  of  the  sensors  become  dependent  across 
time  for  each  sensor,  when  the  sampling  rates  increase,  and  correlated  across  sensors,  when  the 
locations  of  the  sensors  are  close  geographically. 

In  [6],  the  problem  of  distributed  detection  of  a  signal  in  continuous-time  correlated  additive 
Gaussian  noise  is  addressed  and  the  optimal  decision  test  for  each  sensor  derived,  for  a  fixed 
fusion  rule;  the  results  are  limited  to  Gaussian  noise  statistics  and  a  signal-in-additive-noise 
configuration.  In  [7],  a  fusion  scheme  with  a  two-bit  fuzzy  decision  at  each  sensor  is  considered  for 
the  case  of  a  single  observation  with  dependence  across  sensors;  this  is  an  interesting  formulation 
but  not  a  sufficiently  general  one.  In  our  work  of  [8],  we  treat  multi-sensor  detection  problems 
with  correlated  observations  across  time  and/or  sensors  of  general  m-dependent  or  mixing  type. 
However,  although  the  decision  rules  of  the  sensors  were  coupled  through  the  optimization  of 
a  common  cost  function,  no  quantization  or  fusion  of  the  sensor  decisions  or  observations  was 
considered  in  [8].  In  our  other  recent  work  of  [9]  (a  companion  paper),  quantization  and  fusion 
schemes  were  considered  in  a  multi-sensor  context  for  the  purpose  of  detecting  the  presence  of 
weak  signals  in  stationary  dependent  noise. 

In  this  paper,  we  consider  quantization  and  fusion  in  multi-sensor  systems  for  the 
discrimination  between  arbitrary  observation  sequences  under  the  two  hypotheses 
which  are  dependent  across  time  and/or  sensors.  This  work  complements  that  of  [9] 
and  extends  it  significantly  by  considering  non-weak-signal  models  for  the  observations  of  the 
sensors.  In  particular,  the  observations  sequence  of  each  sensor  under’ the  two  hypotheses  are 
arbitrary  (not  necessarily  signals  in  additive  noise)  stationary  dependent  sequences.  The  weak- 
signal  model  of  [9]  and  the  associated  locally  optimum  detection  approach  are  not  applicable 
here;  a  nonlocal  detection  approach  is  adopted.  For  all  the  quantization  and  fusion  schemes 
of  interest  to  this  paper,  we  consider  two  operational  scenarios,  one  in  which  the  observation 
sequences  of  the  various  sensors  under  each  hypothesis  are  characterized  by  dependence  across 
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time  only  and  another  characterized  by  dependence  across  both  time  and  sensors. 

In  our  model,  the  sensors  collect  n  observations  each  (n  being  the  common  sample  size)  and 
either  form  test  statistics,  which,  after  quantization,  are  transmitted  to  the  fusion  center,  or 
directly  transmit  to  the  fusion  center  a  quantized  version  of  their  observations.  At  the  fusion 
center,  the  information  sent  by  the  sensors  is  processed  further  and  a  decision  is  reached  as  to 
which  of  two  hypotheses,  Hq  or  Hi,  is  true.  Here  we  introduce  and  analyze  four  quantization 
and  fusion  schemes.  In  all  four  schemes,  the  sensors  employ  memoryless  nonlinearities,  while  the 
fusion  center  conducts  an  appropriate  likelihood  ratio  test  (and  a  different  one  for  each  scheme) 
with  a  threshold  that  can  be  easily  determined.  These  schemes  are  described  in  detail  later  in 
this  section. 

To  model  the  dependence  in  the  observations  of  each  individual  sensor  and  between  observa¬ 
tions  of  different  sensors  we  use  stationary  mixing  processes,  such  as  stationary  m-dependent, 
^-mixing,  or  p- mixing  processes.  The  definitions  of  these  mixing  processes  and  the  associated 
central  limit  theorems  are  detailed  in  [10]  and  [11].  These  models  of  dependence  have  been 
successfully  used  in  single-sensor  discrimination  problems  (see  [12]- [15] ). 

As  a  rule,  the  optimal  detection  schemes  under  the  above  models  of  dependence,  for  ei¬ 
ther  single-sensor  or  multi-sensor/fusion-center  configurations,  involve  high-order  (larger  than 
two)  probability  densities  of  the  observations,  which  not  only  are  difficult  to  characterize,  but 
also  impose  prohibitive  requirements  on  processor  memory  and  complexity  for  the  storage  and 
processing  of  the  dependent  data.  To  avoid  these  complications  in  our  analysis,  design,  and 
implementation,  we  employ  suboptimal  sensor  decision  statistics,  and  sensor  quantiza¬ 
tion  schemes  based  on  memoryless  nonlinearities;  the  fusion  rules  employed  at  the  fusion 
center  are  basically  likelihood  ratio  tests  based  on  these  suboptimal  sensor  decision  statistics  or 
quantization  schemes.  These  are  easier  to  implement  and  their  derivation  requires  knowledge  of 
only  the  univariate  and  bivariate  probability  densities  of  the  observation  sequences.  Large  sam¬ 
ple  sizes  are  necessary  for  the  validity  of  the  analysis  and  the  usefulness  of  the  resulting  detectors; 
but  these  are  well  suited  to  hypothesis  testing  problems,  for  which  a  high-quality  performance  is 
desirable  involving  the  ability  to  discriminate  between  two  sequences  of  observations  with  very 
similar  statistical  description.  Our  analysis,  which  is  based  on  a  nonlocal  detection  approach,  is 
valid  under  large  sample  sizes  for  any  statistical  relationship  between  the  observation  sequences 
under  the  two  hypotheses;  yet  it  is  the  case  involving  close  similarity  in  the  statistical  descrip- 


2 


tions,  for  which  our  approach  is  most  appropriate,  as  high-quality  discrimination  necessitates 
large  sample  sizes.  For  single-sensor  systems,  these  memoryless  nonlinearities  were  successfully 
used  in  [12]-[13]  for  nonlocal  detection  (discrimination)  problems  with  ra-dependent  or  mixing 
stationary  observations,  and  in  [14]- [15]  for  designing  optimal  quantizers,  by  considering  the 
quantizers  as  special  nonlinearities. 

We  now  describe  the  four  schemes  (termed  Schemes  1  to  4)  for  quantization  and  fusion  in 
multi-sensor  detection  systems  with  dependent  observations,  whose  analysis,  optimal  design,  and 
performance  evaluation  (via  simulation)  constitute  the  subject  of  this  paper.  In  our  description, 
we  focus  attention  on  a  two-sensor  configuration;  the  extension  to  configurations  with  more 
sensors  is  discussed  in  a  later  section.  First  we  introduce  the  notation  for  the  model  of  the  sensor 
observation  sequences,  the  memoryless  nonlinearities,  the  test  statistics  used  by  the  sensors,  and 
characterize  the  necessary  probability  distributions  for  the  statistical  description  of  our  multi¬ 
sensor/fusion-center  system. 

The  distributed  discrimination  problem  considered  in  this  paper  is  formulated  as  the  following 
distributed  binary  hypotheses  testing  problem: 

H{0k)  : 

H[k)  :  X,-(fc)  ~  f$(Xlk));  k  =  1,2  (1) 

where  =  (A^,  •  •  • ,  Xn ^)  is  the  vector  of  n  stationary  observations  of  sensor  k  and 

the  associated  n-th  order  multivariate  density  of  the  observation  process  for  sensor  k  under 
( i  =  0, 1).  The  above  model  of  the  sensor  observations  is  motivated  by  practical  problems  of  naval 
discrimination  between  targets  (ships)  and  decoys  (chaff  or  active  decoys).  As  is  shown  below, 
only  the  univariate  (marginal)  and  bivariate  (second-order  joint)  pdfs  of  the  sensor  observation 
sequences  are  involved  in  our  analysis.  The  observations  sequences  are  not  modeled  as  signal 
in  additive  noise.  As  mentioned  already,  the  two  observation  sequences  (k  —  1,2) 

are  modeled  by  stationary  m-dependent,  or  mixing- type  processes  such  as  <£- mixing  or  p-mixing 
processes.  As  discussed  above,  we  are  interested  in  worst-case  situations,  in  which  the  statistical 
description  of  the  observation  sequences  under  the  two  hypotheses  are  similar,  because  large 
sample  sizes  are  then  necessary,  if  high-quality  discrimination  is  desirable.  For  example,  if 
Ei[x[h^]  —  £o[A^]  and  E\[(X^)2}  =  i?o[(.xj^)2]  {k  =  1,2)  (i.e.  the  means  and  the  powers  of 
the  observations  under  the  two  hypotheses  are  equal),  a  large  sample  size  is  required  to  achieve 


3 


small  error  probabilities  at  the  fusion  center. 

Let  f]k(  )  be  the  memoryless  nonlinearity  for  sensor  k.  In  general,  the  argument  of  5/.  is  a 
continuous-amplitude  (real)  variable  and  gt-  can  take  all  real  values.  (In  reality  g t  is  implemented 
through  a  discretized  form  with  a  large  but  finite  number  of  amplitude  levels,  a  matter  to 
which  we  will  return  later).  When  quantization  is  employed,  gk(-)  takes  the  particular  form 
of  a  quantizer  characterized  by  a  finite  number  quantization  levels  and  breakpoints.  The  test 
statistic  for  each  sensor  k  basically  has  the  form 

Tn,fc  =  -X>(X,(fc));  A:  =1,2  (2) 

n  1=1 

where  n  is  the  common  (large)  sample  size,  and  a  stationary  observation  sequence 

characterized  by  m-dependent  or  mixing  type  dependence.  These  mixing  processes  are  described 
in  detail  in  the  tutorial  [10];  we  will  not  repeat  the  definitions  here. 

In  general,  the  true  distribution  of  Tn%k  (k  —  1,2)  is  difficult  to  obtain.  However,  for  large 
n,  one  can  employ  the  central  limit  theorem  for  the  aforementioned  mixing  processes  (see  [11]). 
Specifically,  under  hypothesis  ( k  =  1, 2;  i  =  0,1),  define 


k'k^idk)  —  ■E't'Pn.fc])  k  —  1,2 


mk,i 


,i(9k)  =  van[gk(X{ ')]  +  2  covi[gk(x{  ’), gk(Xf^)] 

3= 1 

where  uar,[-]  and  cov{[-]  are  the  variance  and  the  covariance  operators,  respectively,  and 


(3) 

(4) 


(51,52)  = 


Ei[(Tn<i  —  Hi^i)  •  (T„,2  —  fj>2,i)] 


(5) 


(^crhia2,i) 

where  the  numerator  of  (5)  is  equal  to  ^[covi{gl(X^),g2{X^))}+2  j2,‘  cov,{gi{x[V;),g2{ X ) } ] 

mk,i  ( k  =  1,2)  is  the  range  of  dependence  under  H-k\  for  the  m-dependent  observations  of  the 
k- th  sensor;  and  is  the  corresponding  parameter  for  the  cross-dependence  of  the  obser¬ 

vations  of  sensors  1  and  2.  For  mixing-type  observations  we  take  mq,-  — *  00,  for  l  —  1,2  and 
(1,2).  Clearly,  under  hypothesis  Hjk\  gk,i(gk)  depends  on  the  univariate  pdf  of  the  observa¬ 
tion  sequence  denoted  by  fk,i(-) ;  i(<7fc)  depends  on  fk,i  and  the  bivariate  pdfs  of 

(X^xj^),  for  j  =  1,2,--  -,  denoted  by  /^(-,-);  and  5,(51,52)  depends  on  f2,i  and  the 
bivariate  pdfs  of  (xj^xj^),  for  j  =  0, 1,2,  ••  denoted  by  •)•  The  primary  distinction 

between  the  above  definitions  for  the  multi-sensor  discrimination  problem  treated  in  this  paper 
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and  the  problem  of  multi-sensor  weak-signal  detection  considered  in  [9]  is  that  the  variances 
under  the  two  hypotheses  are  generally  different  in  this  paper,  whereas  they  are  the  same  in  [9]. 

According  to  the  central  limit  theorem  for  mixing  processes,  under  (i  =  0, 1)  and 
suitable  conditions  (see  [10]-[11]),  if  i  converges  (which  is  trivially  satisfied  for  m-dependent 
processes  but  needs  to  be  assumed  for  mixing-type  processes)  and  i  >  0,  (Tnjk  —  Hk,i)  I 
is  asymptotically  Gaussian  distributed  with  the  standard  A/"(0, 1)  pdf.  The  conditions  for  the 
validity  of  the  central  limit  theorem  for  each  individual  type  of  mixing  processes  are  described 
in  [11]  and  are  assumed  to  be  satisfied  in  this  paper.  For  example,  p-mixing  processes  (for 
which  the  numerical  results  in  this  paper  are  generated)  require  not  only  the  convergence  of 
the  infinite  series  above  and  the  strict  positivity  of  the  variance,  but  also  the  convergence  of 
the  series  pf\n,  where  p,-)ri  denotes  the  sequence  of  p-mixing  parameters  under  hypothesis 

H^‘,  the  superscript  (7)  is  (1)  for  the  observations  sequence  of  sensor  1,  (2)  for  that  of  sensor 
2,  and  (1,2)  for  the  jointly  considered  observation  sequences  of  sensors  1  and  2. 

Henceforth,  we  suppose  that,  under  hypothesis  ( i  =  0, 1),  Tnik  of  sensor  k  is  asymp¬ 
totically  Gaussian  distributed  with  mean  fJ.k,i(9k)  and  variance  cr^t(p^)/n.  Moreover,  the  pair 
(Tnti,  Tn)2)  is  asymptotically  jointly  Gaussian  distributed  with  the  correlation  coefficient  p;(pi ,  (J2) 
(— 1  <  Pi(pi,P2)  <  1).  For  the  sake  of  notational  convenience,  we  omit  the  arguments  of  the 
means  p^,-,  the  variances  a *it-,  and  the  correlation  coefficients  p,-.  We  also  use  (jk  and  TU)k  to 
represent  the  nonlinearity  or  quantizer  and  the  associated  test  statistic,  respectively,  in  the  gen¬ 
eral  expressions.  We  then  use  different  notation  to  emphasize  their  function  in  the  different 
quantization  and  fusion  schemes  considered  in  this  paper.  Similar  notation  rules  are  applied  to 
the  means,  variances,  and  correlation  coefficients. 

The  four  quantization  and  fusion  schemes  central  to  this  paper  are  defined  as  follows:  In 
Scheme  1  (illustrated  in  Fig.  C.l),  the  k- th  sensor  employs  the  nonlinearity  g k,  forms  the  test 
statistic  Tnik  given  by  (2),  and  transmits  it  directly  to  the  fusion  center,  where  a  likelihood  ratio 
test  is  performed  based  on  Tnik,  for  k  =  1,2.  If  one  could  reliably  transmit  a  real  number  through 
a  bandlimited  channel  (as  are  the  channels  between  the  sensors  and  the  fusion  center),  this  fusion 
scheme,  which  is  optimal  within  the  class  of  schemes  employing  memoryless  nonlinearitiies  in  the 
sensor  test  statistics,  would  also  be  practical.  However,  in  reality  we  can  only  transmit  a  finite 
number  of  bits  of  information  through  the  aforementioned  channels.  Therefore,  quantization  of 
the  test  statistics  or  the  sensor  observations  themselves  is  of  interest  and  this  is  considered  by 
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the  following  three  schemes. 

In  Scheme  2  (illustrated  in  Fig.  C.2),  the  observation  xfk^  of  sensor  k  at  time  l  is  first 
quantized  by  a  uniform  quantizer  <%,  which  is  obtained  from  the  discrete-form  of  the  optimal 
nonlinearity  gk  of  Scheme  1,  and  then  transmitted  to  the  fusion  center,  where  the  test  statistic 

Tn,k  =  -f >pr/fc))  (6) 

n  ;=i 

is  formed.  To  distinguish  the  special  nonlinearity  (a  quantizer)  and  the  associated  statistic  of 
Scheme  2  from  those  of  Scheme  1,  we  use  gk  and  Tn<k  for  representing  them.  Finally,  the  fusion 
center  performs  a  likelihood  ratio  test  based  on  for  k  =  1,2.  This  scheme  is  motivated  by 
the  need  to  reduce  the  information  that  the  sensors  transmit  to  the  fusion  center.  It  is  simple  to 
implement,  because  its  breakpoints  are  uniformly  spaced  over  the  interval  defining  the  support  of 
the  sensor  observations;  for  a  large  number  of  quantization  levels  it  is  supposed  to  approximate 
Scheme  1. 

Scheme  3  is  similar  to  Scheme  2,  except  that  the  quantizers  are  now  optimized  and  can  be 
obtained  from  the  discrete- form  of  the  nonlinearities  of  Scheme  1.  In  order  to  distinguish  this 
scheme  from  Schemes  1  and  2,  the  optimal  quantizer  (breakpoints  and  quantization  levels)  and 
the  associated  test  statistic  of  Scheme  3  are  denoted  by  Qk  and  Tn<k,  respectively,  where 

Tn,k  =  -f2Qk(xlk)).  (7) 

n  i=i 

This  particular  scheme  is  motivated  by  the  need  to  approximate  Scheme  1  by  a  quantization 
method  for  the  usual  reasons  of  reduction  in  the  transmitted  information,  but  it  is  also  supposed 
to  achieve  better  performance  than  that  of  Scheme  2  with  fewer  quantization  levels  (and  thus 
transmitted  bits  of  information),  because  of  the  optimized  break-points  and  levels. 

Finally,  in  Scheme  4  (illustrated  in  Fig.  C.3),  the  sensors  form  the  test  statistics  Tn,k  as  in 
Scheme  1,  then  use  thresholds  to  make  hard  binary  decisions  about  which  hypothesis  (H\  or  H o) 
is  true,  and  then  transmit  these  binary  decisions  to  the  fusion  center.  The  fusion  center  then 
executes  a  likelihood  ratio  test  based  on  the  sensor  decisions  in  order  to  reach  the  final  decision. 
This  scheme  has  the  same  configuration  as  Scheme  4.  However,  the  number  of  quantization 
levels  for  this  scheme  is  two;  thus  we  can  not  use  the  analysis  and  optimal  design  developed  for 
Scheme  4,  which  is  valid  only  if  the  the  number  of  quantization  levels  is  large. 
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As  described  above,  the  function  of  the  fusion  center  is  to  collect  (as  in  Schemes  1  and  4)  or 
form  (as  in  Schemes  2  and  3)  the  test  statistics  from  the  information  transmitted  by  the  sensors. 
In  the  design  of  optimal  nonlinearities  or  quantizers  for  Schemes  1-4,  we  employ  the  Bayesian 
risk  criterion  on  the  error  probabilities  of  the  fusion  center.  We  use  large  deviations  theory  to 
associate  the  Bayes  risk  with  the  final  design  criterion  for  each  nonlinearity  or  quantizer  that 
has  the  form  of  generalized  signal-to-noise  ratio. 

The  course  of  obtaining  the  optimal  nonlinearities  or  quantizers  involves  solving  linear 
coupled  or  uncoupled  integral  equations,  which  depend  on  the  univariate  and  bivariate 
probability  density  functions  (pdfs)  of  the  sensor  noise  sequence.  The  optimal  threshold 
of  each  sensor  q*,  is  determined  by  the  optimization  of  the  individual  design  criterion  for  each 
case;  it  depends  only  on  the  means  and  the  variances  of  the  test  statistics  Tn<k,  under  the  two 
hypotheses,  and  thus  on  the  aforementioned  univariate  and  bivariate  pdfs. 

To  recapitulate,  the  contributions  of  this  paper  are  the  following: 

(i)  it  investigates  asymptotically  optimal  memoryless  schemes  for  quantizing  either  the  sensor 
observations  or  the  sensor  test  statistics,  before  transmitting  to  the  fusion  center,  and  appropri¬ 
ate  fusion  rules  for  multi-sensor/fusion-center  discrimination  problems  with  dependence  in  the 
observations  across-time  and  sensors; 

(ii)  it  extends  the  useful  methodology  of  nonlocal  memoryless  detection  (discrimination)  from 
the  single-sensor  case  to  the  case  of  multiple  sensors  with  a  fusion  center: 

(iii)  it  treats  multi-sensor/fusion-center  discrimination  problems  with  dependence  in  the  obser¬ 
vations  across-time  and  sensors,  thus  extending  the  existing  work  in  this  area  which  deals  with 
i.i.d.  sensor  observations. 

The  remainder  of  this  paper  is  organized  as  follows.  In  Section  II,  preliminary  comments 
apply  to  all  quantization/fusion  schemes  considered  in  this  paper  are  provided.  In  Section  III, 
our  four  quantization/fusion  schemes  are  presented  for  the  case  of  dependence  across  time  only. 
Then  the  same  quantization/fusion  schemes  are  presented  in  Section  IV  for  the  case  of  depen¬ 
dence  across  time  and  sensors;  the  important  special  case  of  sensors  with  identical  univariate 
and  bivariate  probability  densities  is  also  treated  there.  In  Section  V,  numerical  results  for 
the  two-sensor/fusion-center  discrimination  between  lognormal  and  Rayleigh  dependent  obser¬ 
vations  sequences  are  provided.  In  Section  VI,  the  robustification  of  our  schemes  to  statistical 
uncertainty  in  the  sensor  observations,  the  extension  of  our  schemes  to  environments  with  more 


7 


than  two  sensors,  and  the  conclusions  drawn  from  our  work  are  discussed. 

II.  PRELIMINARIES 

Since  a  large  sample  size  is  used,  we  are  interested  in  the  asymptotic  exponential  rate  of 
decrease  of  the  Bayesian  cost  defined  by 


E[C]  =  c0pPo(ln  Ln  >  77)  4-  cx(l  -p)Pi{\nLn  <  p) 


(8) 


where  p  is  the  a  priori  probability  of  Ho  and  c;  (i  =  0, 1)  are  positive  constants.  When  c;  =  1 
(i  —  0, 1),  this  Bayesian  cost  is  the  error  probability  in  the  fusion  center.  The  exponential  rate 
of  this  expected  cost  is  defined  by  —  2-ln  E[C], 

1.  Lower  Bound  on  the  Asymptotic  Rate  of  Error  Probabilities  for  Schemes  1-3 
Here  we  derive  a  lower  bound  on  the  asymptotic  rate  of  E[C ]  (or  upper  bound  on  E[C\), 
which  is  then  used  in  the  design  of  optimal  nonlinearities  or  quantizers  in  Schemes  1-3. 

Since  Tn,i,Tn,2  are  asymptotically  jointly  Gaussian,  the  log-likelihood  ratio  function  of  the 
fusion  center  takes  the  form 


ln  Lri 


n 


2(1  -Pi) 

n 


(En,l  -  Ml,o)2  .  (Ti, 2  -  p 2,o)2  2/?o(7n,l  -  Ml,o)(2n,2  -  M2,o) 

9  T  9 


T,o 


'2,0 


01 ,002,0 


+  ln 


2(1  —  Pi) 
01,002,0 


(Ti, l-^l,l)2  (T„,2  -  M2,l)2  2/>i(Tn4  «  7ixii)(Tni2  A«2,i)! 

919 


'1,1 


2,1 


01,102,1 


(9) 


01,102,1 

which  is  a  quadratic  and  not  a  linear  combination  of  Tnj\  and  and  thus  not  asymptotically 
Gaussian  distributed. 

The  means  of  ln  Ln  under  the  two  hypotheses  are  given  by 


£'o[lnXn]  =  i  +  ln^£_ 


01,0 


02,0 


PoPl  01, 002,0 


01,102,1  2(1  —  p\)cr\}i  2(1  —  p\)o\  x  (1  -  P2)01, 102,1 


n 


2(1  -Pi) 


(Ml,l  -  Ml.o)2  .  (^2,1  -  P2,o)2  2pi(qi,i  -  Pl,o)(p2,l  -  M2,o) 
_2  '  _2 


'1,1 


'2,1 


and 

•Ei  [In  Ln] 


_l  +  lnfl22l£  +  T^!4_  + 


O’- 


2,1 


+ 


01,102,1 


PoPl  01, 102,1 
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+ 


n 


2(1  -Pi) 


01,102,1  2(1  —  Pq)o’i  q  2(1  -  pl)ol  o  (1-^)01,002,0 

(^1,1  —  Pl,o)2  .  (j02,l  -  M2,o)2  2p0(Ml,l  -  hl,o)(h2.1  -  ^2,0) 
+  —2 


'2,0 


G 1, 0^2,0 


m 


moreover,  as  n 


oo 


-E0[\nLr 

n 


2(1  -Pi) 


(p\,l  -  Ml.o)2  .  0*2,1  -  p2fi)2  %Pl(Pl,l  -  fl l,o)(M2,l  ~  /*2,o) 


<7 


+ 


1,1 


£7. 


2,1 


01,1 02,1 


=  VO 

(12) 


and 


n 


£'i  [In  Lr 


1 


2(1  -  pi) 


(Ml,l  —  Ml,o)2  ,  0*2,1  —  p2,o)2 

1"  o 


2po(/*l,l  -  pl,o)(p2,l  -  /*2,o) 


a 


1,0 


'2,0 


O’!, 00-2,0 


Vi- 

(13) 


In  other  words,  P,-[ln  Ln]  (i  =  0, 1)  are  dominated  by  n-<pi,  respectively,  as  n  — >  oo.  To  guarantee 
the  reasonable  condition  that  the  error  probabilities  converge  to  zero,  as  n  — >  oo,  we  assume 


that  the  consistency  condition  described  by 


Vo  <  V  <  Vi 


(14) 


for  i  =  0, 1,  holds  for  the  discrimination  problem  of  interest.  From  the  ergodic  mean  theorem, 
the  condition  (14)  guarantees  that  the  error  probability  in  the  fusion  center  converges  to  zero,  as 
the  sample  size  increases.  A  consistency  condition  similar  to  (14)  was  used  in  the  single-sensor 
environment,  but  with  a  linear  test,  meaning  that  the  log-likelihood  ratio  ln  Ln  was  a  linear 
function  of  the  test  statistics  Tnik  (see  [12]  and  [13]). 

Let  Pmax  —  max{Po(lnLn  >(577),  Pi(ln  Ln  <*#7)}.  Then,  as  n  — ►  00,  we  have 

/>  A 


~ln£[C]  =  —  —  ln 
n  n 


■  ln 


P0(lnLn>hrj)  .  Pi(ln  Ln  <rq)' 
cop - 5 - +  ci(l  -  p) 


Prr 


--In  Prr 
n 


(15) 

To  establish  the  design  criteria  for  the  optimal  sensors  thresholds  and  nonlinearities  or  quan¬ 
tizers,  we  first  apply  the  large  deviation  principle  of  [17]  to  characterize  the  exponential  rates 
of  the  two  types  of  error  probabilities,  and  then  derive  lower  bounds  on  these  rates  (or  up¬ 
per  bounds  on  the  error  probabilities),  which  are  then  used  in  the  design  of  optimal  sensor 
nonlinearities  or  quantizers. 

Now  let  us  give  some  definitions  associated  with  the  large  deviation  principle,  which  are 
relevant  to  our  treatment.  For  s  €  R,  define 


lim  bin(s)=  lim  —  ln  Pt- [exp  (sin  Ln)\ 

n—KX>  ’  n— *•  00  n 

(16) 

Di(bi )  =  [s  E  a2  :  bi(s)  <  00} 

(17) 
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and 


Ii(z)  =  sup {sz  -  bi(s)} 
seR 


(18) 


where  p  ( i  =  0, 1)  are  the  Legendre-Fenchel  transforms  of  Moreover,  under  (i  =  0, 1) 
and  for  a  subset  G  of  R,  define 


Ii(G)  =  inf {I(z)  :z£G} 


(19) 


and  the  induced  probabilities 

Pi(G)  =  Pri{lnLneG}  (20) 

where  I,(G)  are  termed  entropy  functions  in  [17].  It  is  easy  to  show  that  &i,n($i,S2)  (i  =  0, 1) 
are  convex  functions,  for  all  n,  hence  &,-(si,S2)  are  also  convex  functions.  The  following  theorem 
characterizes  the  asymptotic  rates  of  the  two  types  of  error  probabilities  and  is  a  direct  result 
of  Theorem  II. 2  in  [17]. 

Theorem  1:  Under  ( i  —  0,1),  suppose  6,-(s)  exist  for  all  a  E  R  as  closed  functions,  and 
that  Di(bi)  has  non-empty  interior  containing  the  point  s  —  0.  Then,  for  any  closed  subset  F  of 

R, 

lim  inf  --\nPi(F)>  IRF).  (21) 

71 — ►OO  71 

Furthermore,  if  6,-  is  differentiable  on  the  interior  of  Dl  and  steep  in  the  sense  that  the  magnitude 
of  the  derivative  of  6;,  \dbi(s)/ds\  diverges  to  infinity,  as  s  tends  to  the  boundary  of  D{,  then  for 
any  open  subset  E  of  R, 


lim  sup - In  Pi(E)  <  E(E), 

n—+  oo  Tl 


(22) 


In  general,  under  the  true  distribution  of  H\k\  the  exact  form  of  /;  is  difficulty  to  obtain, 
especially  for  the  quadratic  test  characterized  above.  Even  by  assuming  the  joint  Gaussian  dis¬ 
tribution  for  (Tn)i,  Tn,2),  we  cannot  have  useful  criteria  for  optimal  nonlinearities  or  quantizers, 
because  the  supreme  of  sz  —  b{(s)  in  (18)  does  not  take  a  neat  form.  Alternatively,  let  us  consider 
the  local  behavior  of  sz  —  6,-(s)  for  |s|  <  a  with  a  —>■  0.  Let  the  set  Bs  be 


Bs  —  G  R  *  l^l  s',  s  — >■  0} 


which  is  a  subset  of  R.  Then  we  have 


Ii(z )  —  sup{^s  —  61(5)}  >  sup  {sz  —  bi(s)}  =  /,(.-) 
sgR  s£B3 


(23) 


(24) 
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where  /;(z)  (i  =  0, 1)  are  characterized  by  the  following  lemma,  whose  proof,  together  with  the 
proof  of  Theorem  2  that  follows,  are  given  in  Appendix  A. 

Lemma  1:  The  lower  bounds  I{(z)  (i  =  0, 1)  on  the  asymptotic  rates  7,(z)  are  given  by 

h{z)  =  (z  -  (25) 

where  S{  =  arg  supseB  t{sz  -bi(z)}  has  a  sign,  which  makes  J;  positive  for  any  2-  in  the  particular 
set  (S',  and  <p{  are  defined  by  (12),  for  i  =  0,  and  (13),  for  i  =  1. 

Thus,  for  any  set  G,  we  have 

Ii(G)  =  inf  Ii(z)  >  mili(z)  =  Ii{G )  (26) 

-?£Cj  z£0r 

where  Ii(G)  is  defined  in  a  way  similar  to  Ii(G). 

In  other  words,  as  n  —>  oo,  if  —^P{(G)  — *•  Ii(G),  then  Pi(G )  <  exp{— nli(G)}  by  u:sing  (26). 
According  to  this  lemma,  we  have  the  following  theorem,  which  characterizes  the  upper  bounds 
on  the  two  types  of  error  probabilities  of  the  fusion  center  under  a  large  sample  size. 

Theorem  2:  Suppose  the  conditions  in  Theorem  1  for  (21)  and  (22)  are  satisfied.  Moreover, 
suppose  that,  under  Il\k ^  (i  =  0, 1;  k  —  1,2),  Tn ^  are  asymptotically  Gaussian  distributed  with 
mean  ^k,i  and  variance  and  (Tnii,rni2)  asymptotically  jointly  Gaussian  distributed  with  the 
correlation  coefficient  pt-.  Then,  for  In Ln  given  by  (9),  as  n  — ►  oo, 

-  ~  In  P0(\nLn  >nr])  >  (tj  -  <f 0)s;  -- ln  Pj(ln  Ln  <mr?)  >  (pi  -  tj)s  (27) 

n  a  n  a 

where  s  >  0  with  s  — »  0  and  ipi  are  given  by  (12)  and  (13);  consequently,  if  we  define 

I  min  ( 7/ )  =  min  {(77  -  y0)s,  (c^i  -  77)5}  (28) 

then 

-  —  \n.E[C]  >  Imin(v)-  (29) 

n 

Equivalently,  we  have  the  following  upper  bounds  on  the  error  probabilities  and  the  average 
cost 

Po(\n  Ln  >  T])  <  exp{-n(t]  -  (p0)s};  Pi(ln  Ln  <  77)  <  exp{-n(y3i  -  ??)}  (30) 

and 

E[C]  <  exp{—nlmin(ri)}  (31) 
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for  a  large  sample  size  n.  As  n  — *■  oo,  these  upper  bounds  tend  to  be  tight  in  the  sense  that 
both  the  bounds  and  the  error  probabilities,  and  thus  the  average  cost,  converge  to  zero  under 
the  consistency  condition  (fo  <  T]  <  (fii-  Then  the  optimal  threshold  is  obtained  by  maximizing 

I  min  (  7?  ) 

V  =  arg  max  Imin( 77)  =  arg  max  -  1 p0)s ,  (y?i  -  tj)s}.  (32) 

Vo<r><Vl  Vo<n<Vl 

Since  under  the  consistency  condition  <P0<77<<P1  and  for  positive  5,  (rj-<po)s  and  (<pi  -r))s  are 
increasing  and  decreasing  functions  of  77,  respectively,  there  is  a  single  solution  of  the  equation 

(7/  -  (fi0 )5  =  (ipi  -  t])s  (33) 

for  77,  and  it  is  the  optimal  threshold  characterized  by  (32).  By  solving  (33)  we  obtain  the 
optimal  threshold  given  by  77*  =  (<p0  +  <pi)/2;  by  employing  this  optimal  threshold  we  have  the 
final  form  of  the  lower  bound  on  the  asymptotic  rate  of  the  average  cost  given  by 

Wo-)  = 

1  (a*!,!  -  P l,o)2  (p2,l  -  ft2,o)2  _  2/3q(/Ii|1  -  /^1,o)(M2,1  ~  P2,o)  £ 

2(l-p§)[  af  0  cr| 0  o-1i0c72>0  J  2 

.  1  (Pl,l  -  Ml,o)2  ,  (M2,l  -  At2,o)2  _  2pi(^!,l  -  Ml,o)(/72,l  -  p)  £ 

2(1  -/»?)[  ct2,1  ^1, 1^2,1  J  2' 

(34) 

Since  5/2  is  a  positive  constant,  we  may  normalize  the  above  Imin{ V*)  by  dividing  it  by  5/2  to 
obtain  the  design  criteria  for  the  optimal  nonlinearities  or  quantizers  given  by 

Jmin^V 0,  <Pl)  ==  2/T7l,-„(77  )/5  =  <pi  —  <po 

1  (^1,1  —  Pl,o)2  (/P2,l  -  /*2,o)2  _  2po(/ii,i  -  ABpK/^24  ~  P2,p) 

2(1  -  Po)  [  ^1,0  *2,0  ffl, 0^2,0 

I  1  (pi,l  —  Ml,o)2  (^2,1  -  P2,o)"  _  2/?l(/7l,l  ~  ^l,o)(/^2,l  ~  M2,p) 
2(1'—  pj)  [  erf  !  a\x  01,102,1 

(35) 

Thus  the  optimal  nonlinearities  or  quantizers  in  Schemes  1-3  are  characterized  by  the  following 
maximization  problem: 

max  Jm,„(y0,  <pi) 

01  ,92 

subject  to  fik,i  >  Hk, 0;  k  =  1,2.  (36) 
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In  general,  numerical  optimization  techniques,  such  as  nonlinear  programming,  are  required 
for  solving  such  as  maximization  problem,  and  the  two  optimal  nonlinearities  or  quantizers 
are  coupled.  In  the  cases  treated  in  Sections  III  and  IV,  which  are  of  practical  interest,  this 
maximization  problem  can  be  solved  with  the  help  of  analytical  methods. 

2.  Asymptotic  Rate  of  Error  Probabilities  for  Scheme  4 

In  Scheme  4,  each  sensor  transmits  its  decision,  a  binary  random  variable  4,fc,  (k  =  1,2) 
to  the  fusion  center.  This  random  variable  4,fc  is  not  Gaussian  distributed  any  more  (not  even 
asymptotically  Gaussian  as  n  — *•  oo,  and  thus  the  design  criterion  derived  above  is  not  applicable. 
To  derive  a  useful  design  criterion  for  the  optimal  nonlinearities,  we  introduce  here  the  following 
asymptotic  rates: 

-  -lnP0(4,i  =  1,4,2  =  1)  =  --lnPo(rn)jb  >  VuTn,k  >  M2)  (37) 

Tb  71 

-  —  lnPj(dnl  =  0,4,2  =  0)  =  —  —  In P\(Tnk  <  m,Tn,k  <  M2)  (38) 

n  n 

-  —  In  P,(4,i  =  1,4,2  =  0)  =  --In  Pi(Tn,k  >  Mi  ,Tn,k  <  M2)  (39) 

n  n 

and 

-  -  In 4(4,1  =  0,4,2  =  1)  =  --In Pi(Tn<k  <  rji,Tn,k  >  M2)  (40) 

n  n 

for  i  =  0, 1.  In  (37)-(40),  4,fc  =  0,  if  sensor  k  decides  in  favor  of  H0,  and  4,fc  =  1,  if  it  decides 
in  favor  of  H\,  The  lemma  that  follows  characterizes  the  closed-forms  of  the  above  asymptotic 
rates;  it  is  taken  from  Theorem  5  of  [8]  and  proved  there  with  the  help  of  the  large  deviation 
principle. 

Lemma  2:  Suppose 


0-1,0 


and 


0-1,1 


M  0 


>  Pi 


V 2  -  P2,0 

1  > 

m  ~  P 2,0 

v.  Mi  -  Mi.o 
>  Mo 

(41) 

0-2,0 

0-2,0 

o-qo 

M2 ,2  —  M2 

M2,2  -  M2 

^  Mi,i  -  Ml 
>  Mi 

(42) 

0-2,1 

0-2,1 

0-1,1 

ed  statistics  (Tn,i,4 

t,2),  the  asymptotic  rates  defined  by 

(37)-(40)  are  given  by 

1  ,  r,  /.  _n  ,  IN  (M2~M2,o)2  1  1  „  ,  ,  _n  ,  .0*1,1 -*7: 1)2 

111  —  0,  dn^  —  1)  >  O  2  ?  ^  -^1  (^n,l  —  0?  2  1) 

Za2,0 


71 


71 


2^1,1 


(43) 
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1  ,  p  t  j  _  i  ,  _  n\  (r/i  ~  Pi.o)2  1  i  J3  (  i  _  n  ,  _  (P2,i  -  %)2 

*“  In  Jr —  1?  dUl 2  —  0)  *■  O  2  ’  In  (^n,l  —  1  ?  ^n,2  —  0)  * 

71  Z&i  q  Tl 


and 


2^, 


(44) 


In  -fb(^n,l  —  —  1) 


n 


2(1- *3) 


(Pi  -  Pi,o)2  (i?2  -  P2,o)2  2p0(pi  -  p i,o)(f?2  -  P2,o) 

•>  T  o 


<7 


1,0 


'2,0 


01,002,0 


(45) 


--lnPi(dn,i  =  0,d„,i  =  0) 


1 


2(1 -Pi) 

as  n  — *■  oo.  Furthermore, 


(Pi,i-Pi)2  ,  (P24-P2)2  2pi(/ii,i  -  Pi)(p2,i  -  m) 
•  0 


(7 


1,1 


2,1 


<71, 10*2,1 


^  In  3(^71, 1  —  I7  <^n,l  —  1)  ^  ^  In  P{ o(<in,l  —  1?  <^n,l  —  0) 

-  —  In  Po(dn  \  =  1,  dn  1  =  1)  >  —  —  In  Po(dn,i  =  0,dn,i  =  1) 

n  n 

-  -lnP^dn,!  =  0,  dn  \  -  0)  >  --lnPi(dn>1  =  l,dn,i  =  0) 

n  n 

-  —  In  Pi(d„  i  =  0,d„,i  =  0)  >  --lnPi(d„,i  =  0,dn,i  =  1). 

n  n 


(46) 

(47) 

(48) 

(49) 

(50) 


In  the  following  two  sections,  Lemma  2  is  used  to  derive  the  design  criterion  for  the  opti¬ 
mal  sensor  nonlinearities  and  thresholds  of  Scheme  4.  For  notational  convenience,  we  denote 
by  P,-(0, 1),  P,(1,0)  ( i  =  0,1),  P0(l,l),  and  Pi(l,l)  the  probabilities  P;(dn,i  =  0,dnj2  =  1), 
Pi(dn,i  =  1?  dn  2  =  0),  P0(dnA  =  1,  d„j2  =  1),  and  Pi{dn>l  -  0 ,dn<2  =  0),  respectively. 

III.  DEPENDENCE  ACROSS  TIME 

In  this  section,  we  consider  the  case  of  dependence  across  time,  in  which  and 

{.Xf  }g_x  are  mutually  independent.  In  this  case,  p\  =  p0  =  0  and  the  objective  function 
Jmin(f o?<Pi)  characterized  in  Subsection  II. 1  has  the  form 


Jmin{}pOi  *Pl)  — 


1  1 
+ 


2*i,o  2cri,i  j 


(pi,i  -  Pi,o)“  + 


1  1 
+ 


202,0  2tT2,lJ 


(P2,l  -  P2,o)2 


=  ^l(Pl,OjPl,l,01,O!01,l)  +  <lr2(P2,O>Ai2,l)02,O)02,l) 

where  Jk  (k  —  1,2)  are  defined  by 


^(pfc, 0>  Pfc,l’  0fc, Oj  0fc,i)  — 


1  1 
+ 


L2<7M  2ctmJ 


(PM  -  Pfc,o)2- 


(51) 


(52) 
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We  notice  that  Jmtn  given  by  (51)  is  the  sum  of  J\  and  J2,  which  are  functionals  of  their  own 
individual  nonlinearities  or  quantizers  of  the  two  sensors.  Therefore,  the  optimization  problem 
of  J min  characterized  by  (36)  is  decoupled  into  two  optimization  problems  as 


maX  Jki^f^k ,0;  i  &k,0i  &k,l )>  k —  1?  ^ 

9k 

subject  to  /£fctx  >  Hk, 0  (53) 


As  discussed  in  Appendix  B,  this  optimization  problem  can  only  be  solved  via  numerical 
techniques,  which  are  typically  very  computation-intensive  and  make  the  design  procedure  of 
the  optimal  quantizers  (breakpoints  and  levels)  extremely  complex.  To  decrease  the  design 
complexity,  we  introduce  here  the  following  simplified  form,  for  each  Jk, 

(t*k,i  -  Hk,o)2 


Jk('/J‘k,Ch  l^k,\  1  &k,0i  &k,l )  — 


al,o  +  <rltl 


(54) 


which  has  been  used  successfully  in  single-sensor  discrimination  (see  [13]).  According  to  the 
inequality 


1  1 
+ 


>k,  0 


(7 


M 


> 


°jo  +  Tt.i  _ 


2°{o  2al,x  2aloah  (alo  +  ah)2/2  alo  +  ah 

we  obtain 

Jk  >  2Jk',  k  =  1,2, 

with  the  equality  satisfied,  if  and  only  if  x  =  <t|0.  This  implies  that 

E[C]  <  exp  {-nlmin}  =  exp{-nJmins/2}  =  exp{-n(Jj  +  J2)s/2] 
<  exp{-n(./i  +  </2)s} 


(55) 


(56) 


for  all  n;  the  first  inequality  becomes  tight  as  n  — ►  00  and  the  second  as  <r|  x  —»  a2kQ.  Thus  the 
simplified  measures  Jk  are  also  involved  in  an  upper  bound  on  the  average  cost  and  the  upper 
bound  characterized  by  them  becomes  tight  as  c x  —  akQ  — >  0. 

We  notice  that  Jk  has  the  form  of  the  generalized  signal- to-noise  ratio  of  [16];  for  the  case 
of  detection  of  a  weak  signal  in  additive  noise,  a kl  =  a\Q  and  Jk  —  2Jk  takes  the  form  of  the 
deflection  criterion. 

In  the  following,  Jk  is  employed  as  the  design  criterion  for  the  optimal  nonlinearity  or  quan¬ 
tizer  of  Schemes  1-3  described  in  Section  I,  and  this  optimization  problem  is  characterized  by 


IXiaX  J 7  7  &k,  1 ) 

9k 

subject  to  Hk,  1  >  IJ-k, 0;  k  =  1,2. 


(57) 
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1.  Fusion  of  Unquantized  Sensor  Test  Statistics 

We  first  consider  the  scheme  which  has  a  configuration  described  in  Figure  1.  The  sensor  test 
statistics  are  fused  without  previous  quantization.  In  this  case,  the  maximization  characterized 
by  (57)  and  (54)  was  solved  for  single-sensor  systems  and  continuous  nonlinearities  in  [13].  Each 
optimal  nonlinearity  is  obtained  by  solving  the  linear  integral  equation  described  by  (see  [13]) 


( fk,i(x )  -  fk,o(x))/(fk,i(x)  +  fk, o(z))  -  J  Kk{x,y)gk(y)dy  =  gk(x);  k=  1,2  (58) 


where 


m  ™k,  o 

Kk(x,  y)  =  {2  J2  Uk}(x’  y )  -  fk,i(x)fk,i(y)\  +  2  Y,  y)  -  fk,o(x)fki0(y)\ 

j=\  j- i 

- fk,i(x)fk,i(y )  ~  fk,o(x)fk,o(y)}/{fk,i(x)  +  fk, oO)} 


(59) 


is  the  kernel  of  integration.  For  the  case  of  m-dependent  noise,  the  above  kernel  is  well  defined; 
for  the  p-mixing  case  we  should  take  mk^  — ►  oo  but  we  also  need  to  assume  that  the  kernel  in 
(59)  exists,  so  as  to  be  able  to  interchange  summation  and  integration  in  (58). 

2.  Fusion  of  Suboptimally  Quantized  Sensor  Observations 

Here  we  consider  Scheme  2,  in  which  the  nonlinearity  used  to  quantize  the  sensor  observa¬ 
tions  before  transmitting  them  to  the  fusion  center  is  obtained  by  discretizing  the  continuous 
nonlinearity  gk  ( k  =  1,2)  of  Scheme  1.  In  the  discretized  form  of  the  integral  equation  given  by 
(58),  let  the  integration  range  be  (X^n,xjnL),  which  is  assigned  according  to  the  support  of 
the  observation  processes.  Then 


-T,Kk(x\k\xf))gk(xf))Axf)  =  gk(x\k));  k  —  1,2  (60) 

fk,i(xn  +  fkAxr)  U 

\k\i  —  0, 1, 2,  •  •  •,  Mk  are  the  quantizer  breakpoints  with  x^  =  X^]n  and  =  Xmlx 

(XikL  -  X{kl)/Mk),  for 


where  x 

and  Ax^  is  the  discrete  approximation  to  dx  (for  example,  Ax 
sensor  k  ( k  =  1, 2). 

Define  the  vectors 


(k) 

j 


fk,i  =  [/jt,t'(Zofc))>  ;  *  =  M 


(61) 


and  the  matrix 


Gk  =  [G^] 


(62) 
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where 


<$’  =  (/»,,(*!*’)  -  M*tbtf**P,*P)**f>  +  )i 

with  8(x,y)  =  1,  if  x  =  y,  and  0,  otherwise.  Then  the  equation  (60)  can  be  written  as 


where  gu  is  defined  by 


(fk,i  -  A,o)T  =  A;  =1,2 


9k  =  [gk(x{0k)),gk(x[k)),  •  •  -,gk(x$k) 


To  solve  (64)  for  gk,  we  assume  that  xt-  (k  —  1,2;  i  —  0, 1,  •  •  • ,  Mk)  are  chosen  for  each  sensor 
so  that  the  matrix  Gk  is  nonsingular.  Thus  we  obtain 


gj  =  Gk_~\fki  -  fk, of;  k  =  1,2 


Then,  for  any  observation  x,  we  can  characterize  the  quantizer  of  sensor  k  as  follows: 

[gk(x{0k)]  +  gk{x[k))\/2  if  x  <  x[k) 

Sk(x)  =  <  [gk(x\k))  +  gk(x{ik+\  )}/2  if  x  g  (x\k) ,  xjjj  (67) 

k  i)  +  Qkix^l) ]/2  if  x  >  x^_: 

for  i  =  1,2,  •  •  -  ,Mjt  -  2. 

Note  that,  in  the  above  quantization  scheme,  the  number  of  levels  Mk  and  the  breakpoints 
x\k\  for  i  =  0, 1,  •  •  -,Mk,  need  not  be  the  same  for  the  different  sensors.  Moreover,  the  spacing 
of  the  breakpoints  in  the  interval  of  the  support  of  the  noise  process  of  sensor  k  need  not 
be  uniform.  For  the  purpose  of  simplicity  in  analysis  and  implementation,  we  may  set  the 
number  of  quantization  levels  to  be  the  same  for  all  sensors  and  the  breakpoints  uniformly 
spaced  over  the  interval  of  support.  Although  the  resulting  quantization  scheme  will  not  be 
optimal,  its  performance,  as  quantified  from  simulations,  is  acceptable  for  a  reasonable  number 
of  quantization  -levels. 

3.  Fusion  of  Optimally  Quantized  Sensor  Observations 

In  Scheme  2  of  the  previous  subsection,  the  quantizer  gk  of  sensor  k  is  obtained  directly  from 
the  discrete-form  of  the  optimal  nonlinearity  gk  of  the  Scheme  1  and  thus  is  not  an  optimal 
quantizer.  Since  quantizers  also  function  as  nonlinearities  [with  their  sum  in  (7)  satisfying  the 
central  limit  theorem]  we  can  recompute  the  generalized  signal-to-noise  ratio  J *  of  (54)  for  sensor 
k  by  using  quantizers  and  the  test  statistics  of  (7);  then  we  can  use  it  as  the  design  criterion 
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for  the  optimal  quantizer  (breakpoints  and  levels)  of  each  sensor.  This  type  of  scheme  has  been 
addressed  in  [14]  and  [15]  for  single-sensor  detection  of  weak  signals  and  a  different  performance 
measure  (the  efficacy).  The  technique  of  [15]  can  be  extended  from  the  single-sensor  weak-signal 
case  to  the  single-sensor  discrimination  case  treated  in  [13]  and  then  to  the  multi-sensor  case 
treated  below. 

Let  Qk(Xi^)  (/  =  1,2, •••,»)  be  the  quantizer  output,  for  sensor  k  (k  =  1,2),  when  Mk 
quantization  levels  are  employed.  Denote  by  4  =  [4,o  4, l  -  4,mJ  its  breakpoints  and  by 
Uk  —  [v.kt i  uic,2  ■  •■uiCtMk]  its  levels.  Then  the  performance  measure  J, t  of  sensor  k  (k  =  1,2)  in 
(54)  can  be  formulated  as  a  function  of  Uk_  and  tk_  by  considering  the  quantizer  Qk  as  a  special 
nonlinearity. 

The  variances  ;/n  of  f  ]FJL  l  Qk(^j^)  under  II have  the  form 


Mk  rh  i  mk’i  Mk  Mk  rtk  r  rtk,i  , 

=  Y2(uk,l)2  ’  fk{x)dx  + 

1=1  Jtk,!-!  j— 1  r=l  ;=i  ■'tk,r- 1  *fc,I — 1 


x, y)dxdy 


-(2  mk,i  +  1) 


ftk’<  r 

2_,uk,i  /  fk(x)dx 


while  the  corresponding  means  have  the  form 


r  ftk •' 

P-k,i  =  /  Qk{x)fk,i{*)dx  —  /  t  Uk,l  I  fk,i 
J  ,=1  Jtk,l- 1 


(x)dx. 


Thus  the  maximization  of  (57)  will  now  be  with  respect  to  Uk_  and  4- 

For  /  =  1, 2,  •  •  • ,  Mk,  k  =  1,2,  and  under  II ^  ( i  —  0, 1),  define  the  vector 


(68) 


(69) 


A/*,; 


[A/'^A/f^ 


■  A 
>  ^JMk 


(70) 


where 


the  matrixes 


and 


Rk,i  =  [R 


(M 

r,/ 


(71) 


(72) 

(73) 


(74) 
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where,  for  r,  l  =  1, 2,  •  •  • ,  M*,, 


r  r  $<., 

j—l  Jtk,r- 1  Jtk,l- 1 


y)dxdy 


and 


=  (2 mkti  +  1)  /  fk,i{x)dx  ■  f  fk,i(x)dx. 
Then  Jk  (k  =  1,2)  assumes  the  form 

[uk(Afk,i  -  A/jt,0)r]2 


Jk(Qk)  = 


(75) 


(76) 


(77) 


In  this  scheme,  we  assume  that  tk,i  (/  =  0, 1,  •  •  ■ ,  M*)  of  sensor  k  are  chosen,  such  that  the  matrix 
+  Rk,i  —  Rk,i )  is  positive  definite.  Then  the  optimal  quantization  levels  of  sensor  k  for 


fixed  breakpoints  are 


T 

Uk  = 


-i-i 


+  Pk,i  -  Rk,j) 


Li=0 


(A/w  -  Afk,o)J 


(78) 


Upon  substitution  for  the  optimal  levels  given  by  (78)  into  Jk,  we  obtain 


Jk  =  (A/fc,  i  —  Afkfl) 


i  -i 


y~x^c,i + Pk£  -  Rk,i) 


U=o 


(A/fc,i  -  Afkp)1 


(79) 


which  is  now  a  function  of  the  breakpoints  tk_  only.  By  using  numerical  optimization  techniques 
(such  as  the  gradient  method)  for  the  objective  function  Jk  given  by  (79),  the  optimal  breakpoints 
can  be  obtained;  then  the  optimal  quantization  levels  are  determined  by  (78). 

4.  Fusion  of  Binary  (Hard)  Sensor  Decisions 

In  Scheme  4,  the  log-likelihood  ratio  test  at  the  fusion  center  takes  the  form  (see  [2]) 


P 1  (dn,  1  >  dn,2) 


P\i,dn,k) 


in l„  =  iny;7':'"'^  =  i>  ,,  , 

-Po(«n,l)  “n,2)  k=1  Po(nn,A:) 

>Bl 


t[wi(4,i)  +  w2(dn^)]  <h0  mj  =  0 


where 


^k^^Tlyk) 


llni^iL  = 


Wk,i 


if  dn,k  —  I 


(80) 


(81) 


n  ak 

-  n  ^  =  ~wk,0  ^  dn,k  ~  0 

with  ak  and  (3k  being  the  probabilities  of  false  alarm  and  miss  for  sensor  k,  respectively,  and 
nrj  =  0  the  threshold  of  the  fusion  center,  which  corresponds  to  equiprobable  hypotheses  H\ 
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and  H0  (prior  probabilities  p  —  1/2).  Thus  the  LRT  fusion  rule  is  characterized  by  what  is  a 
weighted  sum  of  the  decisions  of  the  sensors  (compare  with  [2]  and  [9]). 

Since,  in  all  cases  of  practical  interest,  +  (5k  <  1  under  a  large  sample  size  n,  we  have 
that  Wk,i  >  0  {k  =  1,2,  *  =  0,1)  for  the  Wk/s  defined  by  (81).  Furthermore,  if  we  denote  by 
rjk  ( k  =  1,2)  the  threshold  of  sensor  k  and  assume  that  it  satisfies  the  consistency  condition 
Pk, o  <  Pk  <  Pk,o,  under  which  the  error  probability  of  each  sensor  tends  to  zero  as  n  — ►  oo 
according  to  the  ergodic  mean  theorem.  Actually,  we  may  use  the  large  deviations  principle  of 
[17]  (which  we  applied  in  [8]  in  the  context  of  multi-sensor  detection  systems  without  a  fusion 
center  and  in  [9]  for  multi-sensor  systems  with  a  fusion  center  for  the  detection  of  weak  signals) 
to  obtain 


and 


Wk,  1 


—  In  - — — 

n  ak 

--In a*  -*■ 

n 


=  -[ln(l  -  (5k)  -In  a*] 

n 

(Vk  -  Pk,o)2 
o 


(82) 


1  1 

Wk,  o  = - m- 

n  (5k 

-> - In  — * 

n 


=  i[ln(l  -  ajt)  -  In  j3k] 
n 

</‘M  -  Vk? 


2<T 


for  n  —*  oo.  Thus  the  optimal  decision  rule  in  the  fusion  center  is  given  by 


(83) 


if  dnji  =  dni 2  =  0,  decide  H o 

if  dUi i  =  0,  dn, 2  =  1,  —w\jo  +  w2, i  <h0  0 

>Hi 

if  dn,  1  =  1,  dn, 2  =  o,  tUi.1  -  W2,0  <H0  0 
if  dn>i  =  d„t2  =  1,  decide  H 


(84) 


This  rule  is  similar  to  the  one  derived  in  Subsection  III. 5  of  [9],  except  that  the  weights  given 
by  (82)  -  (83)  are  different  from  the  corresponding  weights  of  [9]. 

For  the  decision  policy  given  by  (84)  and  under  the  consistency  condition  for  each  sensor,  the 
probabilities  Po(0, 1),  Po(l,0),  P0(l,  1),  Pi(0, 1),  Pi(l,0),  Px( 0,0)  of  the  sensor  decision  pairs 
(0, 1),  (1,0),  (1, 1),  and  (0,0)  converge  to  0,  as  n  increases,  and  so  does  the  error  probability  of 
the  fusion  center  Pe.  Thus  it  is  meaningful  to  consider  the  asymptotic  exponential  rate  of  the 
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error  probability  for  large  n,  —  ^  In  Pe,  as  the  design  criterion  for  the  optimal  sensor  nonlinearities 
gk  and  thresholds  77* .  The  error  probability  Pe  has  the  form 


Pe  -  pPo{w\(dn,i)  +  w2(dnp)  >  0}  +  (1  -  p)Pi{wi(dnA)  +  w2{dnt 2)  <  0}  (85) 

where  p  is  the  a  priori  probability  of  Hq  (p  =  1/2  in  our  case).  The  optimal  fusion  test 
characterized  by  (84)  results  in  the  coupling  of  the  two  optimal  thresholds  of  the  sensors  when 
they  disagree  (see  [9]).  As  discussed  in  [9],  to  avoid  the  complex  optimization  problem  that 
can  be  solved  only  via  numerical  techniques,  we  consider  a  simpler  suboptimal  approach  which 
is  shown  via  simulation  results  to  provide  satisfactory  performance.  According  to  this,  we  try 
to  make  +  w2p  and  —  -  w2i 0  respectively  as  large  and  as  small  as  possible,  since  this 

will  maximize  their  distance  on  opposite  sides  of  the  fusion  center  threshold  77  =  0  and  thus 
minimize  the  fusion  center  error  probability.  Specifically,  we  determine  the  sensor  thresholds  by 
maximizing  the  minimum  of  the  weights  given  by  (82)  -  (82),  that  is, 


arg 

max 

minfwfc  .1 . 

Hk,0<Vk<P-k,l 

.  I 

r  1 

arg 

max 

mm  { 

1 

^k,0<Vk<Hk,l 

1 

l  n 
(  / 

arg 

max 

min  \ 

\  {Vk 

Vk,0  <nk<V-k,l 

t 

Vk,0Pk,\  +  &k,lPk,Q 

2erk,o 


2crfc,i 


&k,0  +  <Tk,  1 

Employing  the  above  thresholds,  we  obtain 


(Pk,  1  Pk,  0) 

v>k,  1  =  Wfc.O  =  ™k  =  — - ; - -w; 

2(crfci0  +  crk,\Y 


k  =  1,2 


(86) 


(87) 


Next  we  evaluate  the  asymptotic  rate  —  MnPe  by  applying  Lemma  2.  In  the  case  treated 
in  this  section  p0  =  p1  =  0,  and  thus  conditions  (41)  and  (42)  are  satisfied,  which  enables  the 
application  of  Lemma  2.  We  thus  have  the  following  expressions  for  the  error  probabilities  a 
and  (3  of  the  fusion  center  under  the  two  hypotheses  (see  also  [9]) 


a  =  I(wi  +  w2  >  0)Po(l,  1)  +  I{w\  +  w2  >  0)Po(0, 1) 

+I(w  1  -w2>  0)Po(l,  0)  +  I(wx  -w2>  0)Po(0, 0) 

=  P0(l,  l)  +  /(uq  +  w2  >  0)Po(0, 1)  +  I{w\  -  w2  >  0)Po(l,0)  (88) 
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and 


0  =  I(wl  +  102  <  0)Pi(l,  1)  +  I(wi  +  Wi  <  0)Pi(0, 1) 

+I(w  1  -  w2<  0)Pi(l,0)  +  I(w  1  -  w2<  0)Pi(0,0) 

=  J(wi  +w2<  0)Px(0, 1)  +  I(wl  -w2<  0)Pi(l,  0)  +  Px(0, 0)  (89) 

where  1(A)  for  a  set  A  denotes  the  indicator  function  of  A.  To  minimize  the  above  error 
probabilities  we  must  consider  the  following  different  cases: 

Case  ( i )  :  W\  >  w2  or 

(/P,i  ~  Mi,o)2  >  (f-2,1  ~  M2,o)2 

(^1,0  +  o-gi)2  ~  (cr2,o  +  cr2,i)2' 

Then  W\ti  —  w2io  >  0  —  rj  and  -tux,o  +  w2j  <  0  =  rj,  which  implies  that 

a  =  Pq(1,  1)  +  Pq(1,  0)  and  0  =  Px(0, 1)  +  Px(0, 0)  (91) 


and  the  error  probability  of  the  fusion  center  has  the  form 


Pe  =  p[P0(l,  1)  +  P0(l,  0)]  +  (1  -  p)[Pi(0, 1)]  +  Pi(0, 0)].  (92) 


Intuitively  the  condition  wi  >  w2  corresponds  to  the  decision  of  sensor  1  being  more  reliable 
than  that  of  sensor  2,  and  thus,  in  case  of  disagreement  between  the  two  sensors,  the  fusion 
center  decides  according  to  the  decision  of  sensor  1;  this  is  reflected  in  the  forms  of  the  error 
probabilities  derived  above. 

Let  Pmax  be  the  maximum  of  the  various  error  probability  terms  in  Pe  given  by  (92),  that 
is, 

Pmax  =  max{P0(l,  1),  P0(l,  0),  Pi( 0, 1),  Px(0,  0)}.  (93) 


This  Pmax  dominates  the  asymptotic  rate  of  Pe.  Using  the  results  of  Lemma  2  we  obtain 


--lnPe 

n 


- In  Pmax  =  min 

n 

(/^i.i  ~  Mi,o)2 

(<?i,o  +  (?i,i)2 


(Vi-Ai,o)2  (Vi,i-Vi? 


2<7?, 


where 


Pi  =  arg  max  min 


(m  -  mi.o)2  (/*  1,1  -  m? 


2  a 


1,0 


2<i 


This  is  in  total  agreement  with  the  results  in  (85)  and  (86). 


(94) 


22 


The  maximization  of  the  performance  measure  resulting  from  the  above  asymptotic  rate  was 
discussed  in  [12]-[13]  in  the  context  of  single-sensor  discrimination  and  requires  the  solution  of  a 
nonlinear  integral  equation  with  respect  to  the  nonlinearity  gx .  This  nonlinear  integral  equation 
was  shown  in  [12]  to  have  a  unique  solution  and  can  be  efficiently  solved  as  indicated  in  [13]. 
Still  it  is  difficult  to  robustify  to  uncertainty  in  the  statistics  of  the  observations.  An  alternative 
approach  is  to  use  the  inequality  (x  +  y )2  <  2(x2  4-  y 2)  (which  becomes  an  equality  for  x  =  y) 
and  obtain  the  following  lower  bound  on  the  asymptotic  rate: 

(/T,  1  ~  M l,p)2  (fl  1,1  -  ££l, o)2  /nr:'} 

Ko  +  ^,02  -2(<t20  +  <721)-  1  J 

This  lower  bound  becomes  tight  as  a\  x  — ►  cr20.  The  maximization  of  this  lower  bound  with 

respect  to  gx  has  been  addressed  in  Scheme  1  of  Subsection  III.  1  and  results  in  the  solution  of  a 

linear  integral  equation.  When  w\  >  w2,  we  may  use  the  above  lower  bound  in  the  design  of  the 

optimal  nonlinearity  gx.  Since  we  would  like  the  inequality  wx  >  w 2  to  hold  for  our  choice  of 

gx  as  the  nonlinearity  maximizing  (95),  it  suffices  that  the  desirable  inequality  (wx  >  w2)  holds 

for  the  52  that  maximizes  (^2,1  -  M2,o)2/(cr2,o  +  cr2,i)2  (i.e.,  the  weight  w2);  this  g 2  can  also  be 

obtained  from  the  analysis  of  Subsection  III.l. 

Case  ( ii )  :  wx  <  w 2  or 

(Mu  _  Mi,o)2  <  (m 2,1  -  ^2,o)2 


(  ^  \2  —  /  ,  ,2.  (96) 

( 0-1,0  +  Vl,ir  (0-2,0  +  0-2,1  y 

Then  wx  —  <  0  and  —W\  +  io2  >  0.  This  case  is  dual  to  case  (i).  The  formulation  and 

necessary  steps  are  similar  to  those  of  case  (i);  we  only  need  exchange  the  roles  of  sensors  1  and 
2  to  obtain  the  appropriate  optimization  conditions  for  sensor  nonlinearities  and  thresholds. 
Therefore,  we  do  not  repeat  the  details  here. 

In  summary,  for  Scheme  4  and  under  equiprobable  hypotheses  the  optimal  thresholds  and 
nonlinearities  of  sensors  are  determined  from  (86)  and  (58)-(59),  respectively. 

IV.  DEPENDENCE  ACROSS  TIME  AND  SENSORS 

The  case  of  interest  here  is  that  of  identical  sensor  univariate  and  bivariate  densities,  i.e., 

h,i(x)  -  hAx )  =  Mx)  and  fij(x^y)  -  f2,hx,y)  =  fiJ\x,y), for  *  =  °,  i  and  j  = 

where  we  assume  that  miit-  =  m2}i  =  mi  for  the  range  of  dependence  of  the  sensor  observations. 
In  addition,  let  f^\x,y)  —  fi( X =  x,Xj+x  —  y ),  for  j  —  0, 1, 2,  •  •  -,  fhi,  denote  the  joint 
densities  of  the  two  observation  processes,  where  rhi  is  the  range  of  dependence  of  the  observa¬ 
tions  across  the  two  sensors  under  hypothesis  Hi  (i  =  0, 1).  It  is  easy  to  verify  that  the  choice 
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9i  —92  —9  for  the  sensor  nonlinearities  satisfies  the  necessary  conditions  for  the  maximization 
problem  given  by  (36).  Although  we  cannot  argue  that  this  is  the  optimal  solution,  it  is  a 
reasonable  choice,  as  it  simplifies  the  implementation  and  provides  a  satisfactory  performance 
(as  we  establish  via  simulation).  The  optimal  solution  is  very  difficult  to  obtain  via  analytical 
techniques. 

Thus  =  ji2,i  =  Pi  and  oqit  =  <72,;  =  o-,-,  under  H f'1  (i  —  0, 1),  and  Jmin  takes  the  form 


Jmin(<p 0,<Pl)  =  (mi  -  Hof 


+ 


(1  +  Po)<7q  (1+PlWJ’ 


(97; 


Similar  to  the  maximization  problem  characterized  by  (53),  the  maximization  of  the  above  Jmt-„ 
under  pi  ^  po  requires  numerical  techniques  and  is  not  amenable  to  further  analysis.  Hence, 
we  turn  to  a  lower  bound  on  Jmin  similar  to  the  one  used  in  Section  III.  In  particular,  we  apply 
a  modified  version  of  the  inequality  in  (55)  to  deduce  the  following  lower  bound  on  Jmtn 


Jmin  4.7 


where 


J(fxuai,pi;i  =  0,1)  = 


(Hi  ~  Hof 


(1  +  Po)°o  +  (1  +  Pi)cr  1 

with  the  equality  satisfied,  if  and  only  if  and  p\  =  po-  This  implies  that 


(98) 


E[C]  <  exp  {-nlmin}  =  exp{-nJmins/2} 

<  exp{—n(2J)s} 

for  all  n ;  the  first  inequality  becomes  tight  as  n  — ►  oo  and  the  second  as  a\  —>  and  p\  — >  po- 
Then  the  optimal  g  is  obtained  by  solving  the  maximization  problem  described  by 


max  J (pi,  up  pi ;  i  =  0, 1) 

9 

subject  to  pi  >  po.  '  (99) 


This  maximization  problem  is  solved  in  Appendix  C,  and  the  sufficient  and  the  necessary  con¬ 
ditions  for  the  optimal  g  are  derived  there. 

We  notice  that  both  (97)  and  (98)  define  new  expressions  for  the  signal-to-noise  ratio  of 
two-sensor  discrimination  systems.  These  performance  measures  are  reminiscent  of  (but  dis¬ 
tinctly  different  from)  the  corresponding  efficacy-type  measures  derived  in  [9]  for  the  two-sensor 
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detection  of  weak  signals.  For  the  rest  of  this  section,  we  use  ./  as  the  design  criterion  for  the 
nonlinearity  or  the  quantizer  in  the  various  quantization  and  fusion  schemes  except  for  Scheme 
4. 

1.  Fusion  of  Unquantized  Sensor  Test  Statistics 

From  the  maximization  of  Jmt-n  performed  in  Appendix  C  we  obtain  that  the  optimal  non¬ 
linearity  satisfies 

(fi(x)  -  fo(x))/(fi(x)  +  /o(x))  -  J  Kc(x,  y)g{y)dy  =  g(x)  (100) 

where  the  integration  kernel  takes  the  form 

771 J  771 J 

Kc(x,y)  =  {2  Y,[f{:\x,y)  -  fi(x)fi(y)}  +  2^[/1(j)(r,2/)  -  /i(r)/i(?/)] 
j- i  f= i 

mo  mo 

+2  53/ oj)(x,y)  -  fo(x)fo(y)\  +  2j2[fo\x,y)  -  fo(x)fo(y)} 

j= i  i=i 

y)  -  2fl(x)f1(y)  +  ff\x,  y )  -  2f0(x)f0(y)}/{fi{x)  +  f0(x)}.  (101) 


This  is  similar  to  the  derivation  of  g k  for  Scheme  1  in  Subsection  III.  1 . 

2.  Fusion  of  Suboptimally  Quantized  Sensor  Observations 

Similar  to  Scheme  2  in  Subsection  III. 2,  we  now  obtain  the  discretized  form  of  (100)  as 


fi(xj)  -  fo(xj) 

fi(xi)  +  fo(xi) 


M 

—  )  A  c(.X\i  Xj)g(xj) /\xj  —  g(xjj 

j—0 


(102) 


where  a;,-,  for  i  =  0, 1, 2,  • • •, M  with  M  =  M\  =  M2,  are  the  breakpoints.  We  define  the  vectors 


h  =  [fi(x0),  fi(xi),  ■  ■  ■ ,  fi(xM)\ ;  i  =  0,1 

and  the  matrix 

<2  =  [cy  . 

where 

Gij  —  (/i(^'!')  fo(xi^)[h  (x{7  x jj j  -j-  <5(aq,rj)]. 
Then  (102)  can  be  written  as 

(f1-fg)T  =  GgT 


(103) 


(104) 


(105) 


(106) 
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where  g  is  defined  in  the  same  way  as  gg  in  Subsection  III.2.  We  again  assume  that  x,-  (i  = 
0, 1,  •  •  -,M)  are  chosen  such  that  the  matrix  G  is  nonsingular.  Thus  we  have 

9T  =  Q-\h-  /of  (107) 


and.  the  quantizer  for  this  scheme  is  characterized  by 


9(x)  =  \ 


(108) 


b(*o)  +  gk(xi)}/2  if  x  <  Xi 

[g(xi)  +  g(xi+i)}/2  if  x  6  (x,-,  xt+1] 

[g(xM- 1)  +  g{xM)]/ 2  if  x  >  xm-i 

for  i  —  1,2,  •  •  •  ,M  -  2. 

3.  Fusion  of  Optimally  Quantized  Sensor  Observations 

Similar  to  Scheme  3  of  subsection  III.3  but  using  the  same  structure  for  both  quantizers,  in 
other  words,  Qi  —  Q2  =  Q ,  tg  =  £2  =  t  and  ui  =  ug  =  u,  we  obtain  under  (i  =  0, 1) 

m, 

Pi&i  =  £;[Q(Xi(1))Q(x{2))]  -  (Et[Q(xil))}  f  +  2  ^{£i[Q(X1(1))Q(xj2)1)]  -  (^(X^)])2} 

i= 1 

mi  M  M 


M  M 


UTUl 


=  u‘  I  j  fl°](x,y)dxdy  +  2^{]Tjr 

r=l  ;=1  “^i-l  j=l  r=l  i=l 

W  P, 

-(2m,- +  l)[]Tu;  /  fi{x)dx]2 
1=1 

where  p,-  is  the  correlation  coefficient  Tn<i  and  Tn^-  In  addition, 


rtr 


'tr- 1 


/^(x^dxd?/} 


(109) 


r  M  />t| 

Ai  =  Q{x)fi(x)dx  =  fi(x)dx.  (110) 

a  •'U-1 

Let  the  vectors  A /&,,•  and  the  matrixes  P^,-  and  Pfc,;  be  defined  in  the  same  way  as 
in  Scheme  3  of  Subsection  III.3.  Then  under  symmetric  densities  and  identical  quantizers, 

A/m  =  a f2,i  =  a/,-,  Fig  =  In  =  =  In  ~  II  and  In  =  In  -  =  °>  !)• 

Moreover,  under  hI  ( i  =  0, 1),  define  the  matrixes  PCi;  =  [pjd;]  and  Pc>;  =  J  where 


PS,l=t  Jl  f-°\x,y)dxdy +2^2  J  J  fj 

Jt r-l  Jtl- 1  J  =  1 


mi  pr  pi  -(i) 


(x,y)dxdy. 


and 


Rcli  =  (2 rhi  +  1)  •  /  fi(x)dx  ■  f  fi(x)da 

’  ’  Pr_l  Pi-1 


(Ill) 


(112) 
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Then  the  objective  function  J, 
Jmini^Q  )  — 


nin  of  quantizer  Q  is  given  by 

_ -  A/0)r]2 _ 

«Ei=o(£  +  P  +  Pc,i  -  R±  -  Rc,i)W 


(113) 


We  also  assume  that  t  are  such  that  the  matrix  Yl}-o(H  +  Pi  +  Pc,i  ~  Rj 
definite.  Then  the  optimal  quantization  levels  of  sensor  k  are  given  by 


u  — 


y  (  p i  +  + pc,i  —  r± — Re*) 


(AA  -  AfoY 


L{=0 


-  Rc,i)  is  positive 


(114) 


for  fixed  breakpoints.  Upon  substitution  for  the  above  optimal  levels  into  Jm,„,  we  obtain  Jmtn 
as  a  function  of  the  breakpoints  t  as  follows: 


J min  —  (A  A  A/q) 


+  P±  +  Pc,i  —  R±  —  Rc,i) 


(AA  -  A/o) 


T 


(115) 


Li=0  J 

The  maximization  of  this  expression  with  respect  to  the  breakpoints  can  be  accomplished  via 
numerical  optimization  techniques. 

4.  Fusion  of  Binary  (Hard)  Sensor  Decisions 

Unlike  Scheme  4  of  Subsection  III.4  and  due  to  the  correlation  between  the  sensor  decisions 
dn< i  and  dn> 2,  the  log-likelihood  ratio  function  of  the  fusion  center  does  not  assume  an  additive 
form.  The  asymptotic  log-likelihood  ratio  is  given  by 


-ln£„  =  —  In  P\(dn  1, dn  2)  -  -In  PQ(dn<1,dn>2)  (116) 

n  n  n 

for  any  sample  size  n ,  and  the  corresponding  log-likelihood  ratio  test  with  the  threshold  rj  =  0 
(for  equiprobable  hypotheses)  at  the  fusion  center  is  given  by 

1  1  >H] 

In Pi(dn  i,dn2)  1ft -fb(^n,i ? ^71,2)  Ho  0  (117) 

n  n 


which  involves  the  asymptotic  rates  of  the  probabilities  P;(0,1),  P,(1,0),  Po(l,l),  and  Pi (0,0) 
of  the  sensor  decisions. 

Suppose  that  the  conditions  (41)  and  (42)  are  satisfied.  Then  we  can  apply  the  results  of 
Lemma  2  to  our  case  and  obtain  that,  for  a  large  sample  size  n,  the  optimal  fusion  rule  is 
described  by 

if  dn  1  -  0,  dn  2  =  1;  -In  Pi(0, 1)  -  -  In  Po(0, 1) 

n  n 


27 


(a*i,i  -  Mi)2  ,  (m  -  M2,o)2  >Hl  n 
+  - — -  <H0  o 


2crl,l 


2ct2,0 


^  ^n,l  —  l?^n,2  — 


if  ^n,l  —  0?  ^n,2  —  0? 


if  dUy\  —  l?^n,2  —  1? 


-lnP1(l,0)--lnPo(l,0) 
n  n 

(M2,i  -  M2)2  ,  (Mi  -  Mi,o)2  >Hl  n 
+  - — -  <H0  0 


20-2,1 


20-1,0 


—  In  Pi(0, 0)  —  —  In  Po(0, 0) 
n  n 


1 


2(1  -Pi) 


(Mu  -  Mi)2  (a*2,i  -  M2)2  2/>i(a»i,i  -  r?i)(/i2,i  -  M2) 

0*0 


'1,1 


'2,1 


Crl,l  d"  2,1 


:ii8) 


> 1  „ 
<H0  0 


“InPiCl,  1)  —  —  lnPo(l,  1) 
n  n 


2(1  -Pi) 


(mi  -  mi,o)2  ,  (m  -  /i2,o)2  '2p0(m  -  mi,o)(m2  -  a*2,o) 

O  T 


1,0 


cr. 


2,0 


0-1,002,0 


</-/o  0. 


where  we  use  the  fact  that  Pi(l,l)  — >  1  and  Po(0,0)  — s-  1,  as  n  — *  00,  under  the  consistency 
conditions  pk,o  <  rjk  <  Mfc,0  for  the  sensor  thresholds.  Furthermore,  notice  that 


and 


—  In  Px(0, 0)  -  —  In  Po(0, 0)  <  0 
n  n 


—  In  Px(l,  1)  —  —  In  Po(l,  1)  >  0 
n  n 


in  the  last  two  subtests  of  (118),  since  x2  +  y2  -  2 pxy  =  (x  -  py)2  +  (1  -  p2)y 2  =  (y  -  px)2  +  (1  — 
p2)x2  >  0,  for  all  p  G  [—1,1].  Therefore,  the  sensor  decision  pairs  (0,0)  or  (1,1)  always  imply 
that  the  fusion  center  favors  H0  or  H\,  respectively,  in  its  decisions.  To  pursue  the  optimal 
sensor  nonlinearities  and  associated  thresholds  for  the  general  case  of  asymmetric  (unequal) 
sensor  observation  densities  is  a  very  difficult  task.  Instead,  as  in  the  previous  subsections,  we 
restrict  our  attention  to  the  case  of  symmetric  univariate  and  bivariate  densities. 

In  the  symmetric  case,  we  can  easily  check  that  the  conditions  (41)  and  (42)  are  satisfied, 
since  pi  <  1  (i  =  0, 1),  and  the  factors  multiplying  it  in  these  conditions  are  nonnegative,  because 
of  the  consistency  conditions.  Using  the  results  of  Lemma  2  we  obtain  the  fusion  rule 


■  f  j  n  ,  ,  (mi  -m)2  .  (m-m 0?  >Hi  0 

Z(J ^  -£0"q 

(Mi  -  v)2  (M  -  Mo)2  >HJ 

i  ^  9  ^Hq  u 


if  dn  \  —  1,  dn^  —  0, 


2  a2 

if  •fn,!  —  0?  *-f«,2  —  0, 


2a2 


(119) 


(Mi  -  V?  >Hl 


(1  +  M1K2 


<Ho  0 


•  f  ,  _  ,  ,  ,  (M  —  Mo)2  >Hl  n 

if  dn,x  —  1,  un,2  —  1,  \  2  ^Ho  ” 

(1  +  Po)&o 
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where  fj  =  px  =  p2  is  the  threshold  used  by  the  two  sensors.  This  threshold  is  determined  in 
the  manner  described  by  (86)  in  Subsection  III. 4,  that  is  by  maximizing  min{(p  —  p0)2/[(l  + 
Po)°o\i  (Pi  ~  p)2/[(l  +  Pi)ai]}  [the  minimum  of  the  weights  for  sensor  decision  pairs  (0,0)  and 
(1,1)],  except  that  the  final  result  is  now  the  following: 


P  = 


(1  +  Po)°bPl  +  (1  +  Pl)alPO 


(l  +  Po)oO  +  (l  +  /3l)oi 

With  this  threshold,  the  fusion  center  subtests  for  the  various  sensor  decision  pairs  become 

if  -  0.  d„,2  =  1;  4r f  1  (-»1  +  ».)  <H.  o.  i.e.,  -P.+aAhA 


(120) 


i  f  dn,  1  —  1?  2  —  0, 


[(1  +  px)a1  +  (1  +  po)cr0}2 
(pi  -  Po)2(2  +  p\  +  Po) 

[(l  +  p1)«71  +  (l  +  po)o-0]2 

(Pi  -  Po)2 


if  dn?  1  =  0,d„,2  =  0;  - 


if  dn<i  —  l?^n,2  —  1) 


>»i  >«i 

(-Pi  +  Po)  <Ho  0,  i.e.,  -  Pi  +  Po  <Ho  0(121) 


>"1 


[(1  +  pi)ax  +  (1  +  po)o-o]2 
(Pi  ~  Po)2 


[(1  +  P\)a\  +  (1  +  Po)cro]2 
The  error  probability  of  the  fusion  center  takes  the  form 


<H0  0,  i.e.,  decide  H 0 
>"1 

<H0  0,  i.e.,  decide  H\. 


Pe  —  p{-Po(l,  1)  +  I{~P\  +  Po  >  0)Po(0, 1)  +  I(~Pl  +  Po  >  0)Po(l,  0)} 

+(1  -  p){Pi( 0, 0)  +  I(-Pl  +  Po  <  0)Pi(0, 1)  +  J(-Pr  +  Po  <  0)Pi(l,  0)}  (122) 


where  1(A)  is  the  indicator  function  of  A.  To  minimize  this  error  probability,  we  consider  the 
cases  (i)  p\  >  po  and  (ii)  pi  <  po-  Next  we  only  discuss  case  (i). 

In  the  case  p\  >  po,  the  error  probability  of  the  fusion  center  becomes 

Pe  =  pPo(l,l)  +  (l-p)[Pi(0,0)  +  Pi(0,l)  +  P1(l,0)]  (123) 


whose  asymptotic  rate  is  derived  by  using  the  results  of  Lemma  2  as 


- In  Pe 


mm 


(p  -  Po)2  (Pi  -  p)2  (fj  -  po)2  (pi  -  p)2 


n 


2cr2 


as  n 


2 cr\  ’  (l  +  po)^’  (l  +  pi)cr{ 

(p  ~  Po)2  (px  -  p)2 1  (pi  -  Po)2 

k  2(?l  ’  2  (7 2  j  (cr0  +  <Tx)2 

00.  Notice  that  a  different  sensor  threshold,  namely 


=  mm 


(124) 


p  =  ((Topi  +  C7!p0)/((70  +  CTl)  (125) 

maximizes  the  above  asymptotic  rate.  If  we  use  this  threshold  in  the  fusion  rule  of  (119), 
we  obtain  a  fusion  rule  different  from  that  of  (121).  Actually,  since  the  fusion  center  error 
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probability  is  the  ultimate  performance  criterion,  the  above  choice  of  threshold  in  (125)  and  the 
fusion  rule  of  (119)  with  this  threshold  in  place  should  be  preferred  over  any  other  design.  We 
presented  here  the  alternative  design  based  on  the  maximization  of  the  weights  in  the  fusion 
rule  of  (119)  for  the  sake  of  completeness.  The  fact  is  that  no  matter  which  specific  inequalities 
are  satisfied  in  the  various  subtests  of  the  fusion  rule  of  (119)  (because  of  the  choice  of  77),  the 
error  probability  of  the  fusion  center  will  have  the  asymptotic  rate  given  by  (124). 

The  asymptotic  rate  of  (124)  is  lower  bounded  by  (pj  —  po)2/2(cr%  +  af);  this  lower  bound 
becomes  tight  as  of  — ►  Oq.  This  performance  measure  is  preferable,  since  its  maximization  results 
in  a  linear  integral  equation,  as  already  discussed  at  the  beginning  of  Section  III.  Therefore,  the 
sensor  nonlinearity  g  is  determined  as  the  solution  of  the  optimization  problem  described  in 
(57). 

V.  PERFORMANCE  EVALUATION  OF  TWO-SENSOR  DISCRIMINATION  SCHEME 

In  this  section  we  compare  via  simulation  the  performance  of  the  various  quantization  and 
fusion  schemes  which  take  into  account  the  dependence  in  the  sensor  observations  and  employ 
the  optimal  nonlinearities  or  quantizers  to  the  ones  which  ignore  the  dependence.  Moreover, 
we  simulate  and  quantify  the  relative  performance  of  the  four  quantization  and  fusion  schemes 
introduced  and  analyzed  in  the  previous  sections.  The  sensor  observation  processes  are  charac¬ 
terized  by  stationary  lognormal  (under  H\ )  and  Rayleigh  (under  II0 )  univariate  and  bivariate 
(second-order  joint)  densities.  Actually,  a  p-mixing  dependence  model  is  adopted  for  the  sensor 
observations.  These  distributions  under  two  hypotheses  are  motivated  by  practical  problems  in 
naval  target  discrimination,  in  which  H\  models  a  target  (e.g.,  a  ship)  and  Ho  a  decoy  (e.g., 
a  chaff  cloud).  It  is  assumed  that  the  first-  and  second-order  moments  of  the  sensor  observa¬ 
tions  under  Ho  are  equal  to  the  corresponding  ones  under  H 1;  which  makes  the  discrimination 
problem  particularly  difficult.  The  two  hypotheses  are  assumed  to  be  equiprobable,  i.e. ,  p  =  .5. 

A  sample  size  of  n  =  600  is  used  in  all  examples  to  yield  sufficiently  small  error  probabilities. 

1000  simulations  are  run  to  generate  the  numerical  results  in  this  section;  this  is  sufficient  for 
providing  reliable  results,  as  long  as  the  error  probabilities  are  no  smaller  than  .005,  which  is 
the  case  here.  In  our  simulations  the  magnitude  of  the  sensor  observations  is  restricted  to  be 
in  the  interval  [Xmin ,  Xmax] ,  this  is  imposed  from  practical  considerations  involving  the  hard¬ 
ware  of  the  radar  discrimination  systems.  In  this  way,  very  small  and  in  particular  very  large 
samples  (in  the  tails  of  the  pdfs)  are  discarded  and  good  quality  simulation  data  are  fed  to  our 
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quantization  and  fusion  schemes.  The  appropriate  values  of  Xmin  and  Xmax  are  derived  for  the 
specific  sensor  observation  pdfs,  by  requiring  that  Pk,i{X  <  Xmin }  =  Pk,i{X  >  Xmax}  <  e  for 
i  =  0, 1,  k  =  1,2,  and  e  =  10-6.  For  the  lognormal  and  Rayleigh  univariate  and  bivariate  pdfs 
of  interest  in  this  section  the  range  of  observations  turns  out  to  be  (0.02, 16.6).  Finally,  in  all 
examples  involving  quantizers,  we  use  =  M2  =  8,  i.e.,  eight-level  quantizers;  the  number  of 
quantization  levels  was  kept  small  to  reduce  the  complexity  of  the  computation  of  the  optimal 
breakpoints  and  quantization  levels. 

The  lognormal  processes  under  Hi,  for  i  =  1,2,  -  ■■,n  and  k  —  1,2  are  obtained  from 
the  nonlinear  transformations  xj^  =  exp[<7^iv!^  +  j],  in  which  the  Gaussian  processes 

are  generated  from  the  recursion  formula 


<‘1  =  vtf 

=  O&WJL,  +  \A  -  [/4‘iPWj1;  i  >  1 

where  -1  <  <  1  are  correlation  coefficients  and  ( z  =  1,2,-*-,  n),  for  sensor  k  (k  =  1,2), 

is  a  sequence  of  i.i.d.  standard  Gaussian  (jV(0, 1))  random  variables.  In  the  case  of  dependence 
across  time  only,  and  vff  are  generated  independently;  in  the  case  of  dependence  across 
time  and  sensors,  they  are  generated  in  a  coupled  manner  as 


rW 


where  is  another  i.i.d.  standard  Gaussian  (Af(0, 1))  process  and  —  1  <  Pq\  <  1  a  correlation 
parameter;  and  kFlit-  are  mutually  independent. 

The  Rayleigh  processes  under  Hq,  Xq^  are  obtained  from  two  independent  Gaussian  pro¬ 
cesses  Y$  and  45  for  *  =  1,2,.- -,n  and  k  =  1,2  as  X$  =  +  (Z^})\  where 

Yof  and  zjfj  are  generated  by  the  recursions 

y(fc)  _  y(fc) 

■‘0,1  —  K0,l 


and 


r/P) 

^0,1 


r/-(fc) 

u0,l 


7W  _  Jk)  y(k) 
J0,i 


=  p'&zili  + 1/1  -  [pglPC;  ■'  > 1 
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where  —  1  <  p^0  <  1  are  correlation  coefficients  and,  for  sensor  k,  U^}  (i  —  1,  2,  ■  •  •)  are 

two  i.i.d.  standard  Gaussian  (A/"(0, 1))  sequences.  In  the  case  of  dependence  across  time  only, 
V0(?)  are  generated  independently  and  so  are  U$)).  In  the  case  of  dependence 

across  time  and  sensors,  they  are  generated  in  a  coupled  manner  as 


y(2) 

vo,t 


+  and  +  \A  ~  [/> 


,(<=> 


•»(c)  jji1) 


where  Wo,i  and  Wo,i  are  two  i.i.d.  standard  Gaussian  sequences  independent  of  each  other,  and 
—  1  <  /Jq  o  <  1  is  a  correlation  parameter;  U^,  Wo,i>  and  Woj  are  mutually  independent. 

To  simplify  the  presentation  of  the  simulation  results,  we  consider  a  two-sensor/fusion-center 
system  with  sensor  observations  that  have  identical  statistics,  i.e.,  the  first-  and  second-order 
pdfs,  respectively,  of  the  sensor  observations  are  the  same  for  the  two  sensors.  This  implies  that 
the  nonlinearities  and  quantizers  employed  by  the  sensors  are  the  same. 

1.  The  Case  of  Dependence  Across  Time 

Example  1:  This  example  pertains  to  the  Scheme  1  described  in  Subsection  III.  1 :  fusion  of 
unquantized  sensor  test  statistics.  The  values  of  parameters  are  selected  as  —  <7qq  =  2.0 
and  =  p^Q  =  0.9261,  under  Hq\  Vq\  =  =  0.4915,  Pq\  =  p^\  =  0.7982  and 

Pq^  =  Pq\  =  0.9924,  under  H\.  In  the  kernel  of  the  integral  equations  the  infinite  sums  (for  the 
p-mixing  case  treated  here)  were  truncated  to  m^i  =  m2,i  =  300  terms  under  hypothesis  Hx 
and  m^o  =  m2,o  =  30  terms  under  hypothesis  Ho,  for  the  two  sensors.  The  above  values  of  the 
parameters  are  characteristic  of  cases  of  practical  interest  in  radar  discrimination.  According  to 
these  values  of  the  parameters,  the  first-  and  second-order  moments  of  the  observations  under 
Ho  are  the  same  as  the  corresponding  ones  under  H i,  which  makes  the  discrimination  problem 
particularly  difficult.  Since,  under  each  hypothesis,  the  univariate  and  bivariate  densities  of  the 
observations  of  the  two  sensors  are  identical,  the  two  optimal  nonlinearities  have  the  same  form 
9opt(%)-  In  Fig.  l.a,  gopt  and  the  nonlinearity  of  gud{x)  =  ln[/i(z)//o(£)]  (the  i.i.d  likelihood 
ratio  function),  (/,-(x), ;  i  =  0,1,  being  the  univariate  densities  under  H,)  are  presented.  In  Fig. 
l.b,  we  present  the  receiver  operating  characteristics  (ROCs)  of  this  example,  which  show  that 
the  performance  of  gopt  is  superior  to  that  of  gad- 

Example  2:  This  example  pertains  to  Scheme  2  of  Subsection  III.2:  fusion  of  suboptimally 
quantized  sensor  observations.  The  revalent  parameter  values  are  the  same  as  the  ones  in  Exam¬ 
ple  1.  The  corresponding  suboptimal  breakpoints  and  quantization  levels  for  both  quantizers  qud 
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and  qad ,  which  are  induced  from  gopt  and  gm  of  Example  1,  respectively,  are  given  in  Table  1; 
the  resulted  ROCs  are  drawn  in  Fig.  2.  From  the  ROCs  in  Fig.  2,  we  see  that  the  performance 
of  the  quantizer  qopt  is  similar  to  the  one  of  qad  in  this  specific  example;  this  is  due  to  the  low 
number  of  quantization  levels  M\  —  M2  =  8  employed,  as  this  number  increases,  considerable 
improvement  in  the  performance  can  be  achieved. 

Example  3:  This  example  pertains  to  Scheme  3  of  Subsection  III. 3:  fusion  of  optimally  quan¬ 
tized  sensor  observations.  We  use  the  same  parameter  values  as  in  Example  1.  The  optimal 
quantizer  Qopt  (breakpoints  and  levels)  is  shown  in  Table  2,  where  the  quantizer  Qm  obtained 
by  ignoring  the  dependence  is  also  included.  The  resulting  ROCs  are  drawn  in  Fig.  3,  which 
indicates  that  the  scheme  employing  the  optimal  quantizer  provides  superior  performance  to  the 
one  which  ignores  the  dependence. 

Example  4:  This  example  pertains  to  Scheme  4  of  Subsection  III.4:  fusion  of  binary  sensor 
decisions.  The  parameter  values  are  set  to  be  the  same  as  in  Example  1.  In  Fig.  4,  we  compare 
the  ROCs  using  the  nonlinearities  gopt  and  ga j  given  in  Fig.  la.  From  Fig.  4,  we  conclude 
that  the  scheme  employing  gopt  yields  a  performance  superior  to  the  one  which  ignores  the 
dependence. 

Then  we  compare  the  ROCs  of  the  optimal  nonlinearities  or  quantizers  for  the  above  four 
schemes  in  Fig.  4a,  where  we  see  that  the  performance  of  Scheme  1  is  best  (as  intuitively 
expected),  Scheme  4  follows,  then  Scheme  3,  and  the  performance  of  Scheme  2  is  the  worst.  The 
two  schemes  employing  quantized  observations  did  not  perform  that  well  for  this  small  number 
of  quantization  levels  (3-bit  or  8-level  quantizers). 

2.  The  Case  of  Dependence  Across  Time  and  Sensors 

Example  5:  This  example  pertains  to  the  same  scheme  as  in  Example  1.  The  values  of  the 
parameters  /j.q^  and  are  assigned  the  same  as  in  Example  1,  but  here  =  m2,i  =  600 
and  mii0  =  m2jo  =  60  are  used.  In  addition,  we  choose  Pq\  =  0.90  and  p^\  =  0.99.  In  Fig.  5a, 
we  present  the  plots  of  the  nonlinearities  gopt  (which  takes  into  account  the  dependence  across 
time  and  sensors),  gopt  (which  ignores  the  dependence  across  sensors),  and  gm  (which  ignores  the 
dependence  across  time  and  sensors);  gopt  and  gad  have  the  same  forms  as  in  Example  1  since  the 
relevant  parameter  values  are  the  same.  In  Fig.  5b,  we  draw  the  ROCs  of  the  unquantized  fusion 
schemes  employing  these  three  nonlinearities.  From  this  figure,  we  see  that  the  performances  of 
gopt  is  superior  to  that  of  gopt ,  which  is  in  turn  superior  to  that  of  gad- 
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Example  6:  This  example  pertains  to  the  same  scheme  as  in  Example  2.  The  parameter  values 
are  the  same  as  in  Example  5.  The  suboptimal  quantizer  qud  (breakpoints  and  levels),  which  is 
induced  from  gopt  of  Example  5,  is  given  in  Table  3.  In  Fig.  6,  we  draw  the  ROCs  which  use 
the  three  quantizers  qopt ,  qopt  and  qud',  where  qopt  and  qu,{  are  obtained  from  the  gopt  and  gad  as 
in  Example  2.  From  Fig.  6,  we  notice  that  the  performance  of  the  suboptimal  quantizers  qopi  is 
superior  to  that  of  qopt ,  which  in  turn  is  superior  to  that  of  qua- 

Example  7:  This  example  pertains  to  the  same  scheme  as  in  Example  3.  We  use  the  same 
parameter  values  as  in  Example  5.  In  Table  4,  the  quantizer  Q  (breakpoints  and  levels)  resulted 
from  numerical  optimization  is  given.  In  Fig.  7,  we  then  present  the  ROCs  for  the  schemes 
employing  Qopt,  Qopt  and  Qua,  respectively;  Qopt  and  Qud  are  as  defined  in  Example  3.  From 
Fig.  7,  we  observe  that  the  performance  of  Qopt  is  superior  to  that  of  Qopt  and  Qud  and  the 
performance  of  Qopt  is  superior  to  that  of  Qud- 

Example  8:  This  example  pertains  to  the  same  scheme  as  in  Example  4.  The  parameter  values 
are  set  to  be  the  same  as  in  Example  5.  In  Fig.  8,  we  present  the  ROCs  which  uses  gopt  and 
gad  (of  Example  1)  in  this  case.  From  this  figure,  we  see  that  the  performance  of  gopt  is  better 
than  that  of  gad  for  dependent  observations  across  time  and  sensors. 

In  Fig.  8a,  we  provide  the  same  comparison  as  the  one  presented  in  Fig.  4a.  From  this 
figure,  we  conclude  that  in  this  case,  the  performance  of  Scheme  1  is  best,  Scheme  4  follows, 
then  Scheme  3,  and,  finally,  Scheme  2  which  has  the  worst  performance. 

VI.  EXTENSIONS  AND  CONCLUSIONS 

We  conclude  this  paper  by  discussing  some  extensions  of  the  work  presented  in  the  previous 
sections.  First  we  consider  minimax  robust  discrimination  for  an  environment  with  uncertainty 
in  the  sensor  observations,  and  then  a  multi-sensor  system  with  more  than  two  sensors.  Finally, 
conclusions  are  drawn  from  this  work. 

1.  Robustness 

In  the  derivation  of  the  design  criteria  and  the  determination  of  the  optimal  nonlinearities 
and/or  quantizers  for  the  four  schemes  in  Sections  III  and  IV,  the  univariate  and  bivariate 
densities  of  the  sensor  observations  under  the  two  hypotheses  were  assumed  known  a  priori. 
However,  this  assumption  is  not  valid  in  many  practical  situations.  Thus  the  robust  design  of 
the  nonlinearities  and  quantizers  based  on  models  of  uncertainty  in  the  univariate  and  bivariate 
pdfs  is  of  interest.  A  survey  of  robust  single-sensor  detection  techniques  is  given  in  [20].  Here  we 
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pursue  maximin  robust  designs  according  to  which  we  solve  optimization  problems  (involving 
the  design  criteria  of  Sections  III  and  IV)  of  the  form 


(126) 


where  i  =  0,1,  j  —  1,2,  •  •  •,  for  and  k  =  1,2,  for  the  case  of  dependence  across  time 
only;  and 

max  min  J^(g,  //j),  f  Jj))  (127) 

3 

where  i  =  0, 1,  j  =  1, 2,  •  •  •,  m  for  f\3\  and  j  =  0, 1,  •  •  •,  m  for  f\3\  for  the  case  of  dependence 
across  time  and  sensors.  In  these  formulations  the  univariate  and  bivariate  pdfs  belong  to  specific 
uncertainty  classes  as  discussed  in  the  following  paragraph.  The  above  maximin  problems  were 
formulated  for  the  nonlinearities  only,  when  quantizers  are  involved  instead  of  or  in  addition  to 
the  nonlinearities,  we  maximize  with  respect  to  qk  or  (<?a-,%)i  respectively. 

In  the  previous  sections  we  derived  two  basic  distinct  forms  of  objective  functions  for  the 
design  of  optimal  nonlinearities  and  quantizers:  (/qttl  -  g-kfl)2 /{^kfl  +  a2k  x)  ( k  =  1,2)  and 
(/U  -  Po)2/[(l  +  Po)0o  +  (1  +  Pi )crf ] .  The  minimax  robustness  for  the  objective  function  (fik,i  - 
/rfc,o)2/(o'fc]o  +  &k,i)  =  1>2)  has  been  addressed  in  our  work  of  [13]  in  a  single-sensor  context. 

In  [13]  we  modeled  the  uncertainty  in  the  univariate  pdfs  by  Iiuber-Strassen  capacity  classes 
which  include  the  popular  e-contaminated,  total  variation,  band,  and  p-point  classes  (see  [13] 
and  [18]).  For  the  bivariate  pdf  under  hypothesis  Hi  (i  =  0,1)  the  following  uncertainty  class 
was  considered  in  [13] 


sup 

9k 


|  CQT>,-{gfc(XfAr)),gifc(Xff1)}| 
yJvar^g^X^^variig^xf^)} 


<  rkj 


k  =  1,2;  j  —  1,2,  ••■,mk,i 


(128) 


for  all  nonlinearities  gk.  This  class  was  first  introduced  in  [19]  for  robust  single-sensor  weak- 
signal  detection  and  estimation.  Thus  for  the  design  criterion  (gk, l  —  gh,o)2 / +  ^l^ik  =  1,2) 
used  for  Schemes  1  and  2  in  the  case  of  dependence  across  time  (Subsections  III.  1  and  III.2), 
the  least-favorable  univariate  and  bivariate  densities  derived  in  [13]  can  be  applied  directly.  For 
Scheme  4,  since  the  optimal  nonlinearities  in  both  Subsections  III. 4  and  IV.4  are  determined  by 
maximizing  the  same  objective  function  (the  one  described  above),  the  maximin  robust  results 
of  [13]  are  still  applicable.  Unfortunately,  for  Scheme  3  in  both  cases  of  dependence  across  time 
as  well  as  across  time  and  sensors  (Subsections  III. 3  and  IV. 3),  we  were  unable  to  derive  explicit 
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maximin  robust  results  due  to  the  lack  of  closed  form  expressions  for  the  optimal  quantization 
break-points;  further  investigation  is  necessary  here. 

For  the  other  objective  function  (pj  -  Po)2/[(l  +  Po)° o  +  (1  +  Pi)a\}  which  is  employed  as 
the  design  criterion  for  Schemes  1  and  2  in  the  case  of  dependence  across  time  and  sensors 
(Subsections  IV. 1  and  IV. 2),  we  can  exploit  the  similarity  with  the  the  objective  function  (pi  — 
po)2/(a^  +  erf)  of  [13]  ,and  modify  the  formulation  and  results  of  [13]  to  obtain  the  desirable 
maximin  robust  design.  In  particular,  we  use  the  same  uncertainty  classes  for  the  univariate 
pdfs  as  in  [13],  i.e.,  the  Huber-Strassen  classes  mentioned  above,  and  for  the  bivariate  pdfs  we 
require  that  besides  belonging  to  the  class  satisfying  (128)  (for  symmetric  conditions)  they  also 
satisfy 

Cou1-{p(X1(1)),<?(X<2)1)}| 
sup  -  =  — ■  = . . =====  =  == 

3  ^/vari{g(X{11})}vari{g(X^1)} 
under  hypothesis  Hi  (i  =  0, 1)  for  all  nonlinearities  g.  Consequently,  under  the  stationarity 
assumption  we  can  derive  that 

m0  mo 

sup{(l  +  po)oro  +  (l  +  pi>2}  <  1  +  2  roj)  +  fo0)  +  2  ]T  4J)  -var0{g} 

9  _  j=i  i=i 

mi  m  i 

+  1  +  2^  +  f|0)  +  2^7=^  -vari{g}  (130) 

j=i  i= i 

which  is  the  generalization  of  Eq.  (8)  in  theorem  2  of  [19].  Following  procedures  similar  to 
lemmas  1-3  of  [19]  we  can  construct  least-favorable  bivariate  densities  similar  to  those  given  in 
[19].  Then  we  follow  the  procedure  of  [13]  to  derive  the  desirable  maximin  robust  nonlinearities. 
2.  Systems  with  More  than  Two  Sensors 

The  results  for  Schemes  1-4  of  Section  III  in  the  case  of  dependence  across  time  can  be 
directly  extended  to  configurations  with  more  than  two  sensors.  In  a  A'-sensor  environment 
{K  >  2),  it  is  straightforward  to  show  that  the  same  design  criteria  J/.  ( k  —  1,2,  •••,/!')  are 
involved,  because  the  lower  bound  on  the  asymptotic  rates  of  the  error  probabilities  of  the  fusion 
center  is  additive  in  .7^.  Thus  the  optimal  nonlinearities  g^  obtained  in  Section  III  for  Schemes 
1-4  for  k  =  1,2  are  still  applicable  to  the  case  k  —  1,2,  •••,/if  and  are  characterized  by  the 
optimization  problem  (57)  with  Jk  given  by  (55)  for  sensor  k. 

For  the  case  of  dependence  across  time  and  sensors,  however,  even  under  the  symmetric 
conditions  (identical  sensor  observation  statistics  for  different  sensors)  imposed  in  Section  IV, 


j  =  0,1,' 
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coupled  nonlinear  integral  equations  are  involved  in  the  optimization  whose  solution  requires 
numerical  search.  No  explicit  analytic  solutions  can  be  derived  in  this  case. 

3.  Conclusions 

In  this  paper  we  have  designed,  analyzed,  and  simulated  the  performance  of  new  multi¬ 
sensor  fusion  and  quantization  schemes  for  discrimination  (nonlocal  detection)  between  different 
hypotheses  on  the  basis  of  stationary  dependent  observations.  These  schemes  employ  memoryless 
nonlinearities  and  take  partial  advantage  of  the  dependence  in  the  observations  across  time 
and/or  sensors.  For  large  sample  sizes,  our  schemes  are  optimal  within  the  class  of  sensors  using 
memoryless  nonlinearities  and  require  knowledge  of  the  univariate  and  bivariate  pdfs  of  sensor 
observations.  Four  different  schemes  with  varying  degrees  of  optimality  in  the  quantization  and 
fusion  procedures  were  introduced,  analyzed,  and  simulated. 

The  dependence  in  the  sensor  observations  was  characterized  by  one  of  the  mixing  types: 
m-dependent,  ^-mixing,  or  p-mixing.  The  performance  of  the  two-sensor/fusion-center  configu¬ 
ration  was  measured  by  the  error  probabilities  of  the  fusion  center  under  the  two  hypotheses. 

For  three  of  the  quantization/fusion  schemes  treated  in  this  paper  the  generalized  signal-to- 
noise  ratio  (GSNR)  turned  out  to  be  the  appropriate  criterion  as  a  byproduct  of  the  Neyman- 
Pearson  formulation  on  the  fusion  center  error  probabilities.  For  the  fourth  scheme  the  asymp¬ 
totic  rate  of  the  error  probabilities  of  the  fusion  center  was  derived  as  the  performance  measure 
and  this  also  resulted  in  the  GSNR  as  the  criterion  for  the  final  optimization.  For  the  case  of 
dependence  across  time  and  sensors  a  modified  GSNR  which  involves  the  correlation  coefficients 
of  the  test  statistics  of  the  two  sensors  under  the  two  hypotheses  was  derived. 

Optimizing  with  respect  to  the  nonlinearities  and/or  quantizers  in  the  different  cases  has 
led  to  uncoupled  or  coupled  linear  integral  equations  involving  the  univariate  and  bivariate  pdfs 
of  the  sensor  observations.  In  the  case  of  dependence  across  time  only  the  resulting  uncoupled 
linear  integral  equations  are  readily  solved.  In  the  case  of  dependence  across  time  and  sensors 
the  resulting  coupled  nonlinear  integral  equations  are  difficult  to  solve;  however,  for  the  special 
case  of  identical  sensor  statistics  they  reduce  to  a  single  linear  integral  equation  which  is  readily 
solved. 

Comparison  of  the  performance  of  the  various  schemes  via  simulation  establishes  that,  as 
expected,  fusing  the  un quantized  test  statistics  is  superior  to  all  schemes,  and  fusing  binary 
(hard)  decisions  is  superior  to  fusing  quantized  observations  (at  least  for  small  numbers  of 
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quantization  levels).  Optimal  quantization  is  superior  to  uniformly-spaced  discretization  of 
the  continuous  nonlinearities.  Moreover,  two-sensor  discrimination  schemes  which  take  into 
consideration  the  dependence  across  time  and  sensors  outperform  the  corresponding  schemes 
which  ignore  the  dependence  across  sensors,  and  these  schemes  outperform  by  a  large  margin 
the  corresponding  schemes  which  ignore  the  dependence  across  time  and  sensors. 

Maximin  robustification  of  most  of  the  proposed  two-sensor  discrimination  schemes  against 
uncertainties  in  the  univariate  and  bivariate  pdfs  of  the  observation  processes  can  be  carried  out 
by  applying  existing  results  from  single-sensor  systems  to  the  case  of  dependence  across  time 
and  the  one  of  symmetric  densities  for  dependence  across  time  and  sensors.  In  addition,  in  the 
case  of  dependence  across  time,  the  proposed  discrimination  schemes  can  be  directly  extended 
to  configurations  involving  more  than  two  sensors.  In  the  case  of  dependence  across  time  and 
sensors,  the  basic  methodology  is  still  applicable,  but  the  solution  of  nonlinear  integral  equations 
is  required  for  the  optimal  nonlinearities  and/or  quantizers. 
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Appendix  A:  Proofs  of  Lemma  1  and  Theorem  2 

Proof  of  Lemma  1:  By  using  arguments  similar  to  the  ones  used  in  the  proof  of  theorem  2  in 
[12],  we  write  the  following  Taylor  series  expansion  for  6,(s) 


bAs )  =  lim  —  Pjln  Ln]s  -f  o(s ) 

n— *-co  n 

where  o(s)  represents  high-order  terms  with  the  property  o(s)/s  — *  0  as  s  — *  0.  Thus,  as  n  — ►  oo 

bi(s)  -»■  <p{s 

according  to  (12)  and  (13).  Therefore,  from  the  definitions  of  Bs ,  Ii{z)  and  s; 

Ii(z )  =  SUp{,S2  -  6j(s)}  =  sup{(2  -  <pt)s}  =  (2  - 
s£B  s£B 

Moreover,  since  I{{z)  is  the  supreme  of  sz  —  5,(s)  on  the  set  Bs,  we  can  always  choose  the  sign 
of  Si  such  that  Ii(z)  is  positive  for  any  2. 

Proof  of  Theorem  2:  We  first  show  the  proof  of  the  asymptotic  rate  for  Po(ln  Ln  >  nr}).  Let 
2  =  ~  In  Ln  and  define  the  sets  G  and  G  as 

G  =  {In  Ln  €  R  :  In  Ln  >  nr]}  =  {2  £  R  :  2  >  t]} 

and 

G  =  {In  Ln  £  R  :  ln  Ln  >  nr]}  =  {z  6  R  :  z  >  77}, 

in  addition,  let  G  be  the  closure  of  G.  Then  from  the  definition  of  J,(G),  we  have  J,(G)  = 
Ii(G)  =  7,(G).  We  also  have  Po(G)  <  Po(G)  <  Po(G)  since  G  C  G  C  G.  Then  from  Theorem  1, 
we  have 

lim  inf  — —  ln  Po(ln  Ln  >  nr/)  =  lim  inf  —  —  lnPo(G) 

71 — ►oo  77,  n— >00  72 

>  lim  inf  —  —  lnPo(G) 

n— ►  00  72 

>  J,(G)  =  I,(G) 

>  lim  sup  — ilnPo(G) 

n— ►  00  Tl 

=  lim  sup  — —  lnPo(lnP„  >  7177) 

n — kx)  Tfl 

since  {In  Ln  :  ln  Ln  >  nr]}  is  an  open  subset  of  R.  On  the  other  hand, 

lim  sup  — ilnPo(lnP„  >  tit?)  >  lim  inf - lnPo(lnP„  >  nr/). 

n—*<x>  n  n^°°  n 
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Consequently, 


lim - lnP0(lnin  >  nil)  =  lAz  >  77). 

Tt— ►<> O  n  '  ' 

The  above  results  were  stated  and  used  in  [17];  a  similar  formulation  appeared  in  [12].  We  show 
them  here  for  completeness.  Therefore, 

lim - In  P0(ln  Ln  >  mj)  —  Ii(z  >  77)  >  It(z  >  77)  =  inf  I{(z)  =  inf  {z  -  <Pq)sq 

71^00  U  {z>7)}  {z>77> 

where  the  last  inequality  is  from  (26)  and  the  last  equality  is  from  Lemma  1  for  i  —  0.  Under 
the  consistency  condition  (p0  <  77  <  cpi,  we  have 


inf  (z  -  9o)50  =  (»?  -  <Po)sq 
{z>v} 

where  because  77  —  >  0,  s0  is  set  to  be  positive  according  to  Lemma  1.  Following  steps  similar 

to  the  above  ones  we  can  obtain 

lim - ln  Pi  (In  Ln  <  nri)  >  inf  (z  —  (pi)si  =  (tj  —  (fii)si 

n-*oo  n  ~  "  “  {2<,,}V  ^  '  V  r  ' 

where  because  77  —  <px  <0  under  the  consistent  condition,  Si  is  set  to  be  negative.  Equivalently, 
we  have 

lim  lnP^lnin  <  nr])  >  (<pi  -  77)|5i|. 

n— *»oo  ji 

Since  both  S,-  are  small  numbers  near  zero,  we  define 

s  =  min { 50,  |Si|} 


and  thus 


lim  —  ln  Po(ln  Ln  >  nr])  >  (77  —  tp0)s 

n — hdo  71  ' 


and 

lim  — —  lnP!(lnPn  .<  nn)  >  (o?i  —  rj)s 

71— >CO  Jl 

which  are  the  two  inequalities  in  (27).  Finally,  (29)  with  7m,-n  defined  by  (28)  results  from  (15) 

Appendix  B:  Maximization  of  (53) 

It  is  easily  to  check  that  Jk  is  invariant  under  the  scaling  of  gk ■  Consequently,  the  maxi¬ 
mization  problem  characterized  by  (53)  is  equivalent  to  maximizing 


H(gk)  =  fik,  1  -  Pfc, 0  +  A 


+ 


>k,  0 


a 


k,  1 
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where  A  is  the  Lagrange  multipier.  The  necessary  condition  of  the  maximization  of  H(gk)  is 


YeIJ((Jk  +eS9h) \e=0  =  0 


where 

dH(gk  +  e6gk). 
de  ' 


:=o  =  J  6gk(x)dx  {A, i(x)  -  A,o(z) 

.  "H,o 

9k(x)fk, o(*)  +  J _  A,o(*)A,o(y)]  ~  fk,o(x)fkfi{y)\gk(y)dy 


— 2A 


1 

+  — r 


'*,1 


77ifc  1 

fiffc(x)/fc,i(^)+/[2Xi[/2(a:5y)-A,i(;c)A,i(2/)]-/fc,i(a;)AM(S')]s,fc(2/)d2/ 

i=i 


-J  | A,i(x)  -  A,o(x)  -  2A 


A,iO)  ,  A,o(x) 


(7 


M 


'Ar,0  J 


9k(x) 


+ 


/ 


Kk,i{x,y)  Kk,o(x,y) 

A  T  d 


’k,  1 


'k,  0 


9k{y)dy  >  >  Sgk{x)dx 


with  I(k,i(x,y)  defined  by 


mk,i 

KkAx’y)  = 2  £[/$(*,  i0-  fkAx)fk,i(y)}  -  fkAx)fkAy)- 

j- 1 


This  leads  to  the  following  nonlinear  integral  equation 


A,i(x)  -  A,oO)  . ,  f 

-  Sk(x)  +  J 


A,  1(3*)  +  A,o(^) 


Kk,i(x,y)  Kkflfay) 


a 


!,i(flfc)  fffc.oU*)  . 


9k(y)dy 


Unfortunately,  we  could  not  obtain  a  sufficient  condition  for  this  optimization  problem.  Thus, 
although  the  above  nonlinear  integral  equation  can  be  solved  via  numerical  techniques,  we  have 
no  way  of  guaranteeing  the  optimality  of  the  solution. 

Appendix  C:  Maximization  of  (99) 

We  can  easily  verify  that  J  is  invariant  under  the  scaling  of  g.  By  using  similar  arguments 
as  in  Appendix  B,  the  maximization  of  (99)  is  equivalent  to  the  maximization  of 

H{g)  =  Pi  —  Po  +  A[(l  +  Pi)ai  +  (1  +  Po)<tq) 
where  A  is  the  Lagrange  multiplier  and,  for  i  —  0, 1, 


E?.i  sfv,-1’)  -  in]  [£?=,  g(xn  -  m 


(2)x 


Pi<jf  —  Ei 


y/n 


sfn 


nit 

£i{j(vj1))9(xf))}  -  +  2  -  ft?] 

i= 1 


1 


► 

/ 
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The  necessary  condition  for  optimal  g  is  given  by 

dH(g  +  eSg) 
de 

which  leads  to  the  linear  integral  equation 


0 


Ji(a0  /o(*)  T  2Ag(x)[/1(x)  +  fo(x)] 


^  j  "U  rhi 

E  +2  ’(^^-/.(^(y)]  +  [f(x,|/)-2/I(x)/I(y)] 

,=o  l  i=1  j= i 


+2A  | 

The  sufficient  condition  is  given  by 


32i7(<7  +  e^) 

ch2  1 


;=0  <  0. 


and  is  satisfied  if  A  <  0.  Actually,  we  set  A  =  —1/2  in  order  to  be  consistent  with  the  single¬ 
sensor  discrimination  formulation  of  [12]-[13].  Finally,  the  optimal  g  is  obtained  by  solving  the 
linear  integral  equation  given  by  (100)  with  the  kernel  defined  by  (101). 
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Phenomenon 


Phenomenon 


Fig.  C.2  Fusion  of  Quantized  Sensor  Observations  (  Schemes2  and  3  ) 


Phenomenon 


Fig.  C.3  Fusion  of  Binary  Sensor  Decisions  (  Scheme  4  ) 


ROCs 


Fig.  4a  Comparison  of  ROCs  for  Schemes  1—4  in  the  Case 

of  Dependence  Across  Time 


ROCs 


Fig.  8  Comparison  of  ROCs  in  Example  8 


Using  giid(x) 


Fig.  8a  Comparison  of  ROCs  for  Schemes  1—4  in  the  Case 
of  Dependence  Across  Time  and  Sensors 


qopt:  induced  from  gopt 


qud-  induced  from  giid  =  ln(fi/f0) 


I_ L 


breakpoints 

levels 

breakpoints 

levels 

0.02 

-0.4546 

0.02 

-18.3392 

2.0925 

-0.1320  xl0“2 

2.0925 

-0.4213  xlO"1 

4.1650 

-0.1223  XlO-2 

4.1650 

-0.7578  xlO-1 

6.2375 

0.5736  xlO"2 

6.2375 

1.0728 

8.3100 

0.7971  XlO-2 

8.3100 

3.5113 

10.3825 

0.6954  xlO-2 

10.3825 

7.2116 

12.4550 

0.6759  xlO"2 

12.4550 

12.1400 

14.5275 

0.1441  xlO-1 

14.5275 

18.2693 

Table  1:  8-level  Suoptimal  Quantizers  in  Example  2 


Q  opt 

Q  iid 

breakpoints 

levels 

breckpoints 

levels 

0.02 

0.9894 

0.02 

0.5766 

0.2488 

0.8298 

0.4398 

-0.3341 

0.5059 

-0.2231 

0.5889 

-0.4371  xlO-1 

0.7631 

0.1629 

0.7384 

0.2217 

1.0148 

0.4922 

0.9138 

0.4045 

1.2643 

0.4816 

1.0867 

0.5478 

1.5024 

0.5263 

1.5444 

0.5317 

1.7387 

0.3725 

2.6292 

0.3401 

Table  2:  8-level  Optimal  Quantizers  in  Example  3 


qopt:  induced  from  gopt 

breakpoints 

levels 

0.02 

-6.0116 

2.0925 

-0.4211  XlO-2 

4.1650 

-0.7475  xlO-2 

6.2375 

0.3335  xlO"1 

8.3100 

0.8737  XlO"1 

10.3825 

0.2609 

12.4550 

0.3970 

14.5275 

3.9601 

Table  3:  8-level  Suboptimal  Optimal  Quantizer  in  Example  6 


Table  4:  8-level  Optimal  Quantizer  in  Example  7 


